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Preface 



The 27th International Workshop on Graph-Theoretic Concepts in Computer 
Science (WG 2001) was held in Boltenhagen (Mecklenburg-Vorpommern) June 
14-16, 2001. It was organized by the Theoretical Computer Science group of 
the Department of Computer Science at the University of Rostock. The or- 
ganizers gratefully acknowledge the support by the German Research Com- 
munity (Deutsche Forschungsgemeinschaft - DFG) and the state Mecklenburg- 
Vorpommern. 

As was the case for previous WG workshops, this workshop was devoted 
to the theoretical and practical aspects of graph concepts in computer science, 
and its contributed talks showed how recent research results from algorithmic 
graph theory can be used in computer science and which graph-theoretic ques- 
tions arise from new developments in computer science. Moreover, the workshop 
gave an impression of the impact of computer science on the efficiency of graph 
algorithms and on structural aspects of graphs and graph decomposition. 

The workshop looks back to a remarkable tradition of 27 years; all previous 
WG conferences took place in the middle of Europe (Germany, The Netherlands, 
Italy, Switzerland, Slowakia). The participants of WG 2001 came from various 
countries such as Ganada, Ghina, Gzech Republic, France, Great Britain, Greece, 
Italy, Norway, Russia, Sweden, Syria, Taiwan, The Netherlands, U.S.A., and, of 
course, Germany. 

The program committee represented the wide scientific spectrum and the 
aims of the conference. In a careful reviewing process with at least four reports 
per submission, the program committee selected 27 papers from the submissions. 
All accepted papers were presented at the conference, and the referees’ comments 
as well as the numerous fruitful discussions during the workshop have been taken 
into account by the authors of these conference proceedings. Moreover, there 
were two fascinating invited talks by 

H.-J. Bandelt (University of Hamburg) on some connections between graph 
concepts such as median hulls and Steiner trees on one hand and molecular 
biology on the other hand, 

and by 

F. Meyer auf der Heide (University of Paderborn) on data management in 
networks, giving a survey on strategies for distributing and accessing shared 
objects in large parallel and distributed systems. 

It is our pleasure to thank all those who contributed to the scientific success 
of WG 2001: 

- all authors of submitted and of presented papers, and in particular the speakers 

- the referees and subreferees 

- the invited speakers 
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- the DFG 

- the state of Mecklenburg-Vorpommern, and last but not least 

- the organizers Katrin Erdmann, Thomas Szymczak, Ernst de Ridder, supported 
by Roswitha Fengler before and Tilo Klembt, Suhail Mahfud, and Hans-Jorg 
Schulz during the conference. 

August 2001 Andreas Brandstadt and Van Bang Le 
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Median Hulls as Steiner Hulls 
in Rectilinear and Molecular Sequence Spaces 
(Invited Presentation) 

Hans-J. Bandelt 

Departement of Mathematics, University of Hamburg, Germany 
bandeltOmath . uni-hamburg . de 



1 Introdnction 

The Steiner problem is to find a shortest connection of a finite subset X in 
a given metric space (M, d) . If the space meets some local compactness and 
connectivity conditions, then a solution, a Steiner minimal tree for X, exists 
jZ]. More generally, considering any graph-theoretic tree T with all nodes of 
degree < 3 labeled by elements of X (that is, an X -labeled tree), we may ask 
for a minimal length realization of T in (M,d), that is, for an embedding of 
the node set of T in M which extends the identity map of X and yields the 
smallest possible length of the image Steiner tree (relative to the labeled tree 
T). The embedded unlabeled nodes of T are called Steiner points. A Steiner 
minimal tree is then that realization which attains minimum length among the 
(finite) collection of A-labeled trees. Given the subset X, one is interested in 
restricting the search for Steiner points in the metric space (M,d). Any proper 
subset of M that is guaranteed to harbor at least one Steiner minimal tree for X 
is called a Steiner hull for X Particular interest attaches to Steiner hulls 

that are finitely generated in that they are determined by a finite subset S of 
M containing X and all Steiner points of some Steiner minimal tree for X ; this 
finite set S can trivially be turned into a Steiner hull by attaching one geodesic 
from M for each pair of its points. For the Steiner problem in rectilinear space, 
that is, a d-dimensional real space equipped with the metric associated with the 
1-norm, finitely generated Steiner hulls have been described ECS]. 

One of the motivations to study the Steiner problem in rectilinear space 
comes from biology, where one is interested in reconstructing the phylogeny or 
genealogy of species or populations based on molecular or morphological data. 
The Steiner problem is there referred to as the parsimony problem and Steiner 
minimal trees are then called most parsimonious reconstructions of most par- 
simonious trees. Actually, Farris [Hj was the first to describe an algorithm for 
finding a minimal length realization of any A-labeled tree within a rectilinear 
space (an ’’ordered character” in the biological context refers to one coordinate 
of the space). This was accordingly modified for molecular sequence spaces by 
Fitch [^; cf. m- Such a space (equipped with the Hamming distance) is, for 
some d > 0, the d-th Cartesian power of an equidistant alphabet (typically, the 
nucleotide alphabet). In the binary case, that is, say, for the alphabet {0, 1}, it 
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is knowm that median hulls generate Steiner hulls EE]. In fact, an algebraic 
analysis of the algorithm of Farris and Fitch (the FFH algorithm, see below) 
reveals that any X-labeled tree T has a minimal length realization as a Steiner 
tree relative to T using only Steiner points from the median hull of X. In practi- 
cal applications where this hull would still be too large, one could further reduce 
this set in a heuristic fashion, of course at the expense that the Steiner hull 
property may no longer provably be maintained mi- 

2 Reduction to Hypercubes 

Consider a tree T interconnecting a family of sequences from a finite subset X 
of the d-dimensional rectilinear space. Then one can realize T as a Steiner tree 
in the rectilinear space attaining minimal length relative to T [7j. Adjoin to X 
all Steiner points of this Steiner tree and extend the resulting set to a finite grid 
Y (of dimension at most d). This grid can be recoded by 0-1 sequences and thus 
be embedded into a finite-dimensional hypercube H in which every coordinate 
receives a certain weight. We can think of X being embedded in H such that a 
Steiner tree realization of T in id has the same minimal length as in the original 
rectilinear space. We will see below that the adjunction of a set of Steiner points 
was an unnecessary proviso since the median hull of X (necessarily contained 
in Y) turns out to harbor an ample collection of potential Steiner points to 
guarantee a minimal length realization relative to any prescribed tree T. 



3 Farris-Fitch-Hartigan (FFH) Labeling 

First, in order to describe and analyze the FFH algorithm, it is convenient to 
encode any given tree T in terms of its leaves (n > 2). One leaf, 

say to, is selected as a root. In dealing with the thus rooted tree T we employ the 
metaphor of a maternal genealogy and think of T as being visualized in the plane 
such that to is at the top and the other leaves are at the bottom. The rooted 
subtree T' ■=T — to obtained by deleting to (and the edge incident with it) has 
the (unique) daughter node uq of to as its root. The so-called New Hampshire 
encoding [TO] of T' is a string comprising brackets and the leaf names ti , . . . , t„_i . 
The code for T' is obtained bottom-up: if in the first step, say, ti, . . . , ti (i > 1) 
comprise all the daughter nodes of their common mother node u, then we record 
this by u = (ti, . . . ,ti) where the order within the bracket is arbitrary. Then 
delete t\, . . . ,U and turn u into a leaf, which represents the subtree with original 
leaves ti, . . . ,ti. Continue until the bracketing stops with a single node, the root 
Uq of T' . To arrive at the final code, successively replace every interior node by 
the associated bracket. 

We now turn the New Hampshire encodings of the subtrees into operations on 
the hypercube. First we deal with the case of a single coordinate which may take 
values 0, 1. In order to handle ties , we introduce a third letter N, which stands 
for ’’not determined”. For each fc > 1 we define the fc-ary majority operation on 
{0,1, N} as follows. The operation returns 0 if 0 occurs more often than 1 among 
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the k entries, it reurns 1 if 1 is more frequent than 0, and returns N otherwise, 
that is , in the case of a tie between 0 and 1. The outcome is then simply written 
as a bracket with the entries listed as a string. By definition, this operation 
is commutative. For fc = 2 we have (00) = (ON) = 0, (11) = (IN) = 1, and 
(01) = (NN) = N, and for fc = 3 we have (000) = (001) = (OON) = (ONN) = 0, 
(111) = (110) = (IIN) = (INN) = 1, and (OIN) = (NNN) = N. Restricted to 
{0,1} the latter entails the median operation on {0,1} |3|. The fc-ary bracket 
operation on the set {0, 1, N}‘^ of sequences of length d is then defined coordinate- 
wise. 

The FFH algorithm makes two passes through the tree T rooted at to, a 
bottom-up pass and then a top-down pass dnii3|. Starting from the labeled 
leaves ti, . . . any mother node with k children labeled by xi , . . . ,Xk re- 

ceives the label (xi . . .Xfc). Proceeding bottom-up this pass stops with the la- 
beling of the root uq of T' . In the top-down pass, beginning with uq, the un- 
determined (”N”) coordinates are successively specified by 0 or 1: a node of T 
with current label x such that the mother node is already labeled by a 0-1 se- 
quence y receives the 0-1 sequence (xxy) as its new label. This, in brief, is how 
the algorithm works. The (well-known) proof that this labeling yields a minimal 
length realization relative to T is straightforward by induction: interpret each la- 
bel of the bottom-up pass as a set of 0-1 sequences obtained by all combinations 
when inserting 0 and 1 for N; this set describes exactly the 0-1 sequences at the 
interior node under consideration each leading to a minimal length realization 
of the subtree rooted at this node; the final top-down pass then executes the 
minimization link- wise by using the specified 0-1 sequence at the mother node 
in question. 



4 Examples 

First consider the set X = {v,w,x,y} of length 7 sequences v = 1000111, w = 
0100100, X = 0010010, y = 0001001. Let T be the tree with four leaves to, ti, ^ 2 , ts 
and interior edge separating to,ti from t 2 ,t 3 - Then T' := T — to = (ti, (^ 2 ,^ 3 ))- 
When we label to, t\,t 2 ,to by v, w, x, y, respectively, then the root of T' receives 
the label {vw{wxy)) = 0000100 under FFH and the second interior node of T 
is then labeled by (wxy) = 0000000. When we assign to the leaves to,ti,t 2 ,to 
either w, v, x, y or x, y, v, w, or y, x, v, w instead, we obtain six further sequences, 
so that eventually all 0-1 sequences of length 7 with prefix 0000 are generated. 
The resulting set of 12 sequences (including X) is then closed under the median 
operation and constitutes the graph A(4) described in [S]. 

Next we consider the set X = {v,w,x,y,z} of length 15 sequences where 
the coordinates correspond to the partitions of X into two parts such that 0 
is always assigned to the majority part (for convenience). Assume that T is a 
tree with five leaves to,t\,t 2 , to, t^ which are labeled by v, w, x, y, z, respectively, 
unless stated otherwise. Let to serve as the root of T, and put T' := T — to- Since 
there are two binary rooted trees with four leaves, we distinguish two cases. 
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Case 1: T' = ((^ 1 ,^ 2 ), (^ 3 ,^ 4 ))- Then the root of T' receives the label 
{vwxyz) = {v{{vwx)yz){{vyz)wx)) 

by the FFH algorithm. Its daughter node that is the mother node of ^3 and t 4 
then gets the label 

{vwxyyzz) = {{vwx)yz). 

When we permute the labels, the root label always remains the same, but the 
mother node of and f 4 can receive nine further possible labels. 

Case 2: T' = ,ti) . Then via FFH the root of T' carries the label 

{vvwxyyzzz) = {{{wxy)yz)vz), 

its interior daughter node 



(wxyyz) = {{wxy)yz), 
and the interior granddaughter node 



(wxy). 

Reshuffling the labels will produce altogether 30 label candidates at the root, 20 
at its interior daughter node, and 10 at its interior granddaughter node. 

In either case and under each permutation of labels, we obtain a Steiner 
minimal tree (of lenghth 23), where each time the FFH algorithm selects three 
Steiner points from a pool of 71 sequences. 

The FFH algorithm can in fact contract further sequences from X when one 
prescribes a tree with more than five leaves (and thus multiple occurrences of 
labels), which would no longer lead to a Steiner minimal tree for X. Specifically, 
pick a tree with six leaves (z = 0, . . . , 5) such that T—to = ((((ti, t 2 ), ^ 3 ), ^ 4 ), ^s) 
and to, to are labeled by z, w, x, v, y, v, respectively. Then the root of 

T — to receives 

(vvvwxyz) = {{{vwx)vy)vz). 

Permutation of the labels gives four additional sequences. The original five se- 
quences of X together with the former 71 Steiner points and the latter five se- 
quences form a set closed under the median operation and constitute the nodes 
of the graph A(5), see [5]. We will see below that this set exhausts all possible 
interior labels obtainable via FFH from any tree with leaves labeled by sequences 
of X. 

5 Median Hull Includes FFG Hull 

The median hull of a finite subset X of a rectilinear space or a hypercube is 
the smallest (necessarily finite) set containing X and closed under the median 
operation which assigns to any triplet u,v,w its median {uvw). This hull can 
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be constructed by successively adding medians until no further new points (se- 
quences) arise. 

Now, regarding X as a subset of the fc-dimensional hypercube H. One can 
easily check whether a particular sequence a\ . . .ak from H belongs to the me- 
dian hull of X: a, necessary and sufficient condition is that each truncation (alias 
projection) UiUj of the sequence to two coordinates belongs to the correspond- 
ing truncation of X, that is, agrees at coordinates i and j with some sequence 
of X [^. To see this, apply induction on the sequence length. Assume that 
0102 . . . Ofc-iOfe meets this criterion on coordinate pairs. Then by the induction 
hypothesis the two truncated sequences 0i02 . . . ak-i and 02 . . . Uk-iau belong 
to the median hull of the corresponding truncation of X, and the length 2 se- 
quence oiOfc belongs to the truncation of X to coordinates 1, k. Hence there exist 
bi, . . . ,bk from {0, 1} such that 0i02 . . . ak-ibk, 6102 . . . ak-iUk, 0162 ■ • ■ ^fc-iOfc 
belong to the median hull of X. Since {aiUibi) = Ui for all i, we obtain oi . . . Ofc 
as the median of these three sequences, as required. 

In order to show that the FFH algorithm produces only labels from the 
median hull of X, we may therefore assume that the leaves of the leaf-rooted 
tree T under consideration are labeled by 0-1 sequences of length 2. If all four 
sequences occur at the leaves, then there is nothing left to show. So, assume 
that no leaf receives label 11, say. We claim that after the bottom-up phase IN, 
N1 and 11 never label any interior node. Argue by induction from the bottom- 
upwards. Assume that the interior node under consideration has 1 as the first 
label coordinate. This means that more daughter nodes have 1 than 0 at the 
first coordinate. Then by the induction hypothesis the former daughter nodes 
must have 0 at the second coordinate, whereas only the latter daughter nodes 
may have 1 as the second coordinate. Hence the mother node necessarily receives 
label 10, which is fine. Finally, in the top-down phase the feasible labels ON and 
NO are specified as either 00 or 01 and 10, respectively, while NN is replaced 
by the label at the mother node, which is different from 11 (by employing a 
top-down induction argument). 

We conclude that all labels erected by the FFH algorithm lie in the median 
hull of X. 

6 FFH Hull Includes Median Hull 

Now that we know that the FFH labeling stays within the median hull of the 
given set X of binary sequences we may ask whether it actually exhausts the 
latter: can every sequence in the median hull of X be obtained as a FFH label 
of some interior node of a suitable tree T rooted at some leaf Yes, of course. 
To see this, use induction on the number of median operations employed in 
generating a particular sequence v of the median hull of X. Assume that v = 
(U1U2U3) where each Ui is the FFH label of the daughter node of a leaf-root ti 
of some tree Ti, independent of the label from X attached to U. In other words, 
the FFH algorithm already fixes the labels (as a 0-1 sequence) at the root of 
the tree T/ := Ti — ti in the bottom-up phase. Stipulating that all rooted trees 
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are identified with their New Hampshire codes, the rooted tree T' := (T(, T2, Tg) 
then does the job, no matter how we label a new leaf t attached to the root of T' 
in order to arrive at the required tree T . Note that this recursive construction 
yields a ternary tree T' . If instead a binary tree T' is desired, one simply has to 
modify the above definition of T' by setting T' := (((T{,T2), (T{,Tg)), (T^.T^)). 
In fact, the root of T' then receives the label {{{uiU2){uiU3)){u2U3)) = {U1U2U3) 
as before. To verify the latter inequality, it suffices to assume ui,U2, U3 are 0 or 1 
and to consider the two cases where either U2 = u.3 holds or iti = U2 is different 
from U3. 

7 Conclusion 

The rectilinear Steiner problem reduces to the Steiner problem in weighted hy- 
percubes. The median hull, generating a universal Steiner hull, then assists in 
focussing the search for Steiner points. This hull is universal in the sense that, 
for whatever tree T is considered that interconnects a family of given sequences, 
one can always achieve a minimal length realization of T within this hull. In par- 
ticular, the median hull also generates a Steiner hull for the restricted Steiner 
problem where the number of Steiner points is bounded from above by a fixed 
number k, thus solving a problem of m p. 213 ]. In this case one actually needs 
only a minor part of the median hull of the set X of sequences to be optimally in- 
terconnected. For instance, when k = 1 , the Steiner point is always found among 
the ( 2 i + l)-ary medians {x\ . . .X2i+i) of different sequences from X. But even 
if one searches for a Steiner minimal tree in the unrestricted setting, a Steiner 
hull may be generated from a proper subset of the median hull (when ^X > 4 ) 
because different nodes then necessarily carry different labels. 

For non-binary sequences the FFH algorithm in its original formulation does 
not determine the labels uniquely, unless one specifies a tie-breaking rule. This 
can in fact be achieved with a priority ordering of the tree nodes (”pFFH al- 
gorithm”). With arbitrary molecular sequences the pertinent hull is the quasi- 
median hull [Tj; one can then show that the quasi-median hull and the pFFH 
hull of a finite set X in a molecular sequence space coincide (Bandelt and Rohl, 
in preparation). 
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Abstract. We consider an edge-isoperimetric problem (EIP) on the 
cartesian powers of graphs. One of our objectives is to extend the list of 
graphs for whose cartesian powers the lexicographic order provides nested 
solutions for the EIP. We present several new classes of such graphs that 
include as special cases all presently known graphs with this property. 
Our new results are applied to derive best possible edge-isoperimetric 
inequalities for the cartesian powers of arbitrary regular, resp. regular 
bipartite, graphs with a high density. 



1 Introduction 

Let G = (Vg, Eg) be a graph and A, B C Vq- Denote 

Ig{A, B) = {(u, v) £ Eg \ u £ a, v G B}, 

Ig{A) = Ig{A, A), 

lG{m) = max |/g(A)|. 

ACVc,\Al=m 

We will often omit the index G. Our subject is the following version of the edge- 
isoperimetric problem (EIP): for a fixed m, 1 < m < \Vg\i find a set ACVg such 
that |A| = TO and |/(^)| = I{m). We call such a set A optimal. This problem is 
known to be NP-complete in general and has many applications in various fields 
of knowledge, for instance see a survey |4]. 

We emphasize on graphs representable as cartesian products of other graphs. 
Given two graphs G = {VctEg) and H — (Vh,Eh), their cartesian product is 
defined as a graph G x H with the vertex-set Vq x Vh whose two vertices (a:, y) 
and {u,v) are adjacent iff either x = u and {y,v) £ Eh, or (x,u) £ Eq and 
y = V. The graph G" = GxGx---xG(n times) is called the cartesian 
power of G. 

The EIP for the cartesian powers of a graph G have been well studied for 
different graph classes. To present these results we need the definition of the 

* This work was partially supported by the German Research Association (DEG) 
within the SFB 376 “Massive Parallelitat: Algorithmen, Entwurfsmethoden, An- 
wendungen” 
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lexicographic order on a set of n-tuples with integral entries. For that we say that 
{xi, . . . , Xn) is greater than {yi, . . . , j/„) iff there exists an index z, 1 < * < n, such 
that Xj = yj for 1 < j < i and Xi > yi- Assuming that the vertices of a graph G 
are totally ordered, denote by lF"(m) the collection of the first m vertices of G" 
in the lexicographic order. We call such a set an initial segment. Let G be the 
cartesian product of n complete graphs. Then the following Theorem holds. 

Theorem 1. (Harper |2] for pi = . . . = = 2, Lindsey |H] in general) Let n>2 

and Pi > P 2 > ■ ■ ■ ^ Pn- Then |/(lF”(m))| > |d(A)| for any A C Kp.^ x • • • x Kp^, 
|A| = m, and any m > 1. 

In other words, what is more commonly said, for the products of cliques 
there exists a nested structure of solutions, i.e., a family of optimal subsets 
Ai, A2 , . . . , Am, ■ ■ ■ such that \Am\ = rn for any m and Ai C A2 C • • • C Am C 
• • •. We call an order providing the nested structure of solutions an optimal 
order (if such exists). Existence of a nested structure of solutions provides as an 
immediate consequence solutions to such important problems as cutwidth and 
wirelength, and often allows to construct good /c-partitioning of graphs and their 
embedding to some other graphs [1] . 

The classical results presented in Theorem [1] can be extended in various 
directions. For instance, taking an (infinite) path instead of a clique leads to a 
grid. In this case the EIP also has nested structure of solutions m, although 
the optimal order Q is not the lexicographic one. It is further shown in [1] that 
the order Q is also optimal for the products of arbitrary trees. For the definition 
of this order and further details, readers are referred to mm- It is also known 
that no optimal order exists for the powers of cycles of length 5 and larger. 

The order Q and the lexicographic order were for a long time the only ex- 
amples of optimal orders with respect to the EIP. However, as shown in [4], the 
order Q is optimal for products of trees only. Therefore, two natural questions 
arise: (i) for what other graphs is the lexicographic order on the power optimal 
with respect to the edge isoperimetric problems; and (ii) what other optimal or- 
ders can one expect? In j^, we studied the cartesian powers of the Petersen 
graph and presented a new optimal order for them. However, it looks like this 
order is optimal for a rather narrow class of graphs. This, in the light of the 
results presented above, motivates to study further the lexicographic order. 

Following this direction, Ahlswede and Cai proved in that the lexico- 
graphic order is optimal for the products of complete bipartite graphs Kp p. 
They also showed in [2] that this order yields the so-called local-global principle. 
That is, if the lexicographic order is optimal for G^ then it is optimal for G" 
for any n > 3 (see [1] for the local-global principles for some other orders). The 
main difficulty in applying this powerful result is to establish the optimality of 
the lexicographic order for the second cartesian power of a graph. Unfortunately, 
no efficient methods have been developed and the known approaches are very 
specific. In our paper we present new ideas in this direction and apply them for 
three new graph families. These new graphs include, in particular, all presently 
known graphs such that the lexicographic order is optimal on their cartesian 
powers. 
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The paper is organized as follows. In Section 2 we introduce new graph 
classes Hp i and Bp i and present some basic definitions and lemmas which are 
used in the sequel. A motivation for studying these graph classes is provided by 
Lemma [T] which establishes best-possible isoperimetric inequalities for powers 
of arbitrary regular, resp. regular bipartite, graphs. In Section 3 we prove the 
optimality of the lexicographic order for the cartesian powers of Hp^i. The ap- 
proach used in this proof also works well for the powers of complete bipartite 
graphs and complete t-partite graphs and generalize the result of Ahlswede and 
Cai 1^ concerning the complete bipartite graphs. Concluding remarks and open 
questions in Section 5 complete the paper. 

2 Definitions and Auxiliary Lemmas 

Here we introduce three new families of graphs, for whose cartesian powers the 
lexicographic order is optimal. The graph can be constructed from K 2 p by 
partitioning its vertex set into two parts V and V with |R'| = \V'\ = p and 
removing i disjoint perfect matchings between these sets. More exactly, 

=V'vjV", C' = {0,1,...,p-1}, V” = {p,p+l,...,2p-l}, 

EHp^i = {{u,v) \ u,v € V\ or u, u £ V"} U 

{(u, u) I M G V' , V G V” , u + v ^ {2p — j) mod 2p, j = 1, . . . , i}. 

An example of is shown in Fig.[TJa). The vertices belonging to V and V” 
are put into ovals. Note that various ways of deleting i perfect matchings from 
K^p may lead to non-isomorphic graphs. However, we do not worry much about 
that, because the function J(m) for all these graphs is the same, as it follows 
from the proof of Lemma [Sj This function is mostly important for our approach 
rather than the structural properties of the underlying graphs. Another graph 
we study in our paper, is the one obtained from K^p^ip for an even p by 
removing i disjoint perfect matchings (cf. Fig. [IJb) and the conclusion for a 
precise definition). The lexicographic order is also optimal for Bp^ for any n> 2 
and * < This can be proved by following the lines of the proof of Theorem 
Elin the next section. The interest to these graphs is provided by the following 
lemma, whose proof is presented at the end of this section after establishing 
some auxiliary results. 

Lemma 1. 

a. Let G be a regular graph with \Vg\ = VHp,t cind deg(C?) = deg{Hp^i), that 
admits an optimal order in the EIP. Then Ih'‘ . (m) > /g" (m) for any n > 1 
and m = 1, . . . , (2p)”. 

b. Let G be a regular bipartite graph with both independent sets of size p (p 
even) and deg(G) = deg(Bp i), that admits an optimal order in the ELP. 
Then Lgn (m) > iG^(m) for any n > 1 and m = 1, . . . , (2p)". 



12 



Sergei L. Bezrukov and Robert Elsasser 



2 5 




a. 

6 4 3 1 




b. 



R2 



R4 




Fig. 1. Representations of i?3,i (a), i?4,2 (b), and (c) 



This provides best-possible isoperimetric inequalities for powers of regular, 
resp. regular bipartite, graphs. The class of regular graphs is important, because 
in the light of the isoperimetric problem for regular graphs is equivalent to the 
problem of minimization of |6*(A)|, where 0{A) = I{A,A) and A = Vg \ A. The 
function 0{A) appears in most applications of the edge-isoperimetric inequalities 
For a non-regular graph G a solution to the EIP on G provides a lower bound 
for this function following from the identity /(A) U/( 2 l) \J9{A) = Eq- Moreover, 
we conjecture that the regularity of G is important for the lexicographic order 
to be optimal for G" (cf. the conclusion). 

The third graph class we consider here is given by the complete t-partite 
graphs Kp p p. It can be also shown that the lexicographic order is optimal 
for its cartesian powers too. This extends a result of [2| for complete bipartite 
graphs. We also develop a new method for solving the EIP for the product of 
two graphs. This method works well for all graphs mentioned above. 

We will need the following two auxiliary propositions for the sequel. 

Lemma 2. Let G = (Vg,Eg) be a regular graph and A C Vg- Then A is an 
optimal set iff A = Vg \ A is optimal. 

Proof. Let deg(u) = k for any v € Vg. Then 

2\I{A)\ + \I(A,A)\ = k\A\ (1) 

2\I{A)\ + \I{A,A)\=k\A\ = k(\VG\-\A\) 

The above equations imply 2\I{A)\ = fc(|VG| — 2\A\) + 2\I{A)\. Thus, |/(^)| and 
|/(A)| differ on an additive constant for any A C Vg- Hence, if one of these sets 
is optimal, then the other one is optimal too. □ 
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Denote V = ^ for brevity and let J~(m) = {0, 1, . . . , m — 1} for m = 

Lemma 3. \I{T{m))\ > |/(^)| for any ACV with |A| = m. 

Proof. The assertion is obvious if m < p, because Hp^i contains Kp, whose 
vertices are numbered first. Now if m > p then the assertion follows from Lemma 

E] □ 

Let n > 2 and A C f/". For 1 <i < n and x = (xi, . . . , Xn) G denote 

Ai(x) = {y I {xi,...,Xi-i,y,Xi+i,...,Xn) € A}. 

The set A <1 \s called compressed if ^^(x) = {0, 1, . . . , |Ai(x) |} for any 
i = and any x G A. Standard arguments (see |2I4| 1 provide that if a 

graph G admits an optimal order, then for any m > 1 there exist compressed 
optimal sets in Vg". This observation is a key point for the proof of Lemma [TJ 

Proof of Lemma 171 

For a graph G from the statement and j = 1, . . . , |Vg|, let us denote 

^gU) = IgU) - ^gU - with Z\g(0) = 0. 

Taking into account that G admits an optimal order, denote the vertices of G by 
0, 1, . . . , 2p — 1 according to their positions in this order. This provides Vq = V. 

We prove the first proposition of the lemma by induction on n. For n = 1 
we can use the same arguments as in the proof of Lemma IH Note that loinT-) = 
^gU) for any graph G that admits an optimal order. 

Now for n > 2 let A C with |A| = m be an optimal compressed set. 
Then A can be viewed as a subset of V^. Note (see e.g. [2] for ([2l)) that for any 
compressed set A C Vg" = one has 



\Ig^A)\ 



i=l 



P-1 

E 

3^0 



E 

{j,X2,-.-,Xn)^A i=2 



+ E ^g{xi). 

{xi,...,Xn)^A 



(2) 

(3) 



Similar formulas are also valid for compressed subsets B C = V^. Note 

p,i 

that A and B are the subsets of the same vertex set. Set B = A. Now for any 
fixed j in the induction hypothesis implies 



E E Aaixi) < E E (4) 

(j,X2,-..,Xn)GA i=2 (j,X2,-..,Xn)GB i=2 

Furthermore, for a fixed x G ^ and a compressed set A = B (with Ai(x) = 
Si(x)) Lemma [3] implies 

Y 2iG(a;i) = /g(|Ai(x)|) < /if„,(|Bi(x)|) = Y 2\Hp,i(xi) (5) 

ailGAi(x) xi€Bi{x) 
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Summing up @ for j = 0, ... — 1 and 

(xi, X2 , . . . ,Xn) € A and using @ one gets 



7 g" (m) = 1 /g" (A) I < \Ih^ . (B)| < max 

P’* SCV", |S|=m 



for X2t ■■ -I Xn such that x = 



and the induction goes through. 

The second proposition can be proved similarly by taking into account that 
an analog of Lemma [3] is valid for the graph Spy. □ 

By using a similar argument the following result can be proved, that can 
be considered as a best-possible edge-isoperimetric inequality for the cartesian 
powers of general graphs. 



Corollary 1. Let G be a graph with \Vg\ = P that admits nested solutions in 
the EIP. Then Ik^ > for any n > 1 and m = 1 , . . . ,p". 



3 EIP for the Cartesian Powers of i?p,* 

One of the main results of the paper is formulated in the following theorem. 

Theorem 2. // p > 3 and 1 < i < p — i then |/(lF^(|A|))| > |/(A)| for any 
ACV xV. 

Proof. Let A CV xV he an optimal compressed set non-isomorphic to 1F^(|A|) 
(if such exists). Denote 

Vi={{x,y) I 0 < a: < p, 0 < y < p}, 

V2 = {(x,y) jo<x< p, p<y< 2p}, 
ks = {(x,y) I p < a; < 2p, 0 < y < p}, 

14 = {(a;, y) I p < a; < 2p, p < y < 2p}, 

and let Ai = AHVi, i = 1 , 2, 3,4. Obviously, for any i the subgraph induced by 
the vertex set Vi is isomorphic to Kp x Kp. These sets are shown in Fig. [T](c) for 
the graph i7| ^ . The graphs induced by vertices located in one row (or in one 
column) in this figure are isomorphic to i7py. We will often use the fact that any 
vertex of V2 (and V3) is connected with p — i vertices of Vi and versa. In Fig. 
[He) such connections are shown for the vertex (2,2) S Vi. 

Case 1 . Assume A C Vi U V2. Since A is compressed, then |Ai| > IA2I. Thus, 
due to Theorem [T] we can assume that Ai and A2 are initial segments of Vi and 
V2 respectively (see Fig. |2J. Let |Ai| = kip + and IA2I = k2p + 82 for some 
integer fci,fc2, <51,^2 with 0 < k\,k2 < p — 1 and 0 < <5i,i52 < p. If |Ai| = p^, 
we assume ki = p — I and = p. Now |Ai| > IA2I implies k\ > k2- Moreover, 

if k\ = k2 then Si = p or 82 = 0 due to the compression, hence A = T'^{\A\). 
So we can assume k\ > k2- Case la. Assume k\ — k2 is even (and positive). 
First consider the case <5i -I- ^2 < P- We transform A into B = 1F^(|A|) in two 
steps. First, we construct a set C by moving (^2, y) € A2 for p < y < p + 82 to 
{ki,y + Si — p) G Vi \ Ai. It is easily shown that 



|7(C)| - |7(A)1 = ^2(5 i + fci - fc2) - <52 (p - i). 



(6) 
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Fig. 2. Compression of A in case 1 and transformation for + <52 > p in case la. 



Denote Ci = C H Vi, Di = D n Vi and Bi = B n V, for i = 1, 2, 3, 4. Now we 
move (x, y) G C\ with < x < k\, Q < y < p to {x — {k\ — fc2)/2, y + p) £ 

V 2 \C 2 and (x,y) G C\ with x = ki,Q <y < i5i+(52 to ((fci + /ci)/2, y) G Vi\C'i. 
For the resulting set D one has 

|7(D)| - 17(01 - (7 - i - - (A + (7) 

Note that B 2 = D 2 and |Bi| = |Di|. Since the subgraphs induced by V\ and 
V 2 are isomorphic to K 2 x K 2 , then \I{D) \ = \I{B)\. Therefore, from and © 
for A — \I{B)\ — |/(A)| one gets 



p{ki -k2) ( . ki - k2\ ^ f , ki- k2\ ki - k2 ^ , , 

= \^p-t — j +S 2 + — - — j - Si — S2{p-i). ( 8 ) 

Since i < p — i, then p/2 > i. Furthermore, k 2 > 0 and fci < p — 1 imply 
^1 — ^2 < P — 1- These inequalities imply p — i> p/2 > (p — l)/2 > (fci — k2)/2. 
Therefore, p — i — {k\ — k2)/2 > 1, and from (|H) it follows 



> 



{p-Si){ki 

2 



^-52+5iS2. 



(9) 



Since p > Si and <52<5i > <52 for Si ^ 0, then Z\ > 0. If i5i = 0, then taking into 
account fci — ^2 > 2 and ®, we get 



A>"^-^^^^^-S2>p-S2>V 

Now consider the case i5i + 1^2 > p. Then ^2 > 0. We transform A into 
B — in three steps. First construct the set C by moving p — 5i vertices 

of the form (fc 2 , y) € A 2 for 5i + 82 < y < 82 + p to {ki,y — 82 ) G Vi \ t4i. One 
has 



|7(C)| - |7(t4)1 = (p - 5i)(5i +ki-k 2 - [Si + S 2 -p)-{p- i)). 



(10) 
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Next, we move the vertices {k2,y) G C2 ior p < y < 5 \ + 52 to {(k\ + k2) / 2 ,y). 
For the resulting set D one has \I{D)\ = |/(C')|. Finally, we move {x,y) G Di 
with < X < ki and 0 < y < p to {x + 1 — {ki — k2)/2,y + p) € V2 \ £^2- 

The resulting set is B. One has 

|/(B)| - \I{D)\ = (p _ i _ (11) 

It follows from (ITOl l and ([TT| that for A = \I{B) \ - |/(A)| one has 



zi = 



p{ki - fe) 



\p-i- 



fci — ' 



-<5i 



fei — k2 



+{p — 5i)(5i — ^2 + (p — i5i) — (p — i)) + ^2 — 

p(fci - k2) 



fci — ko 



p — i — 



+{p — Si)(i — S2) + S2 



ki - k2\ 
2 ) 
ki — k2 



-Si 



ki — k2 



(12) 



Similarly as above, p — 1 — {ki — k2)/2 > 1. Now if j > 82, then 



p{ki - k2) f 
2 



ki- k2\ fci - . (p - 5i)(fci - fe) - „ 

2 -0 



(13) 



which implies Z\ > 0. 

Note that p — <5i < 82- Thus, if i < <52 and (fci — /c2)/2 >82 — 1, then (IT^ and 
(IDTt imply 



p(fei - fe) / , ki - k2\ „ fci - fc2 ^ ^ ki - k2 

A> 2 [p-^-^^)-Sl^^-S2{S2-^)+S2^- 

^ P{kl-k2) (^^_._ kl-k2 ^_^ ki-k2 
Now \i i < 82 and 82 — i > {ki — k2)/2, then rr2li implies 



Z\ = 



p{ki-k2)( . ki - k2\ , . /c 

— 2 — r-*- 

p{ki - fe) / . ki-k2\ ; ( 

2 [v-^-^^)- 52 [p-^- 

+ 5 i (^2 — i— 2 



ki -k2\ , j 

-^)-p {52 
ki - k2\ 

2 J 



*) 



+ ^2 



ki — k2 
2 



( 14 ) 



Since p > 82 and k\ — k2 > 2, then the last term of dn) and the term in 
brackets are positive. Therefore, Z\ > 0. 

Case lb. In this case we assume that ki — k2 — 1 is even. The proof of this 
case is similar to case la. and it is omitted because of space limitations. 

Case 2 . Assume A C Vi u V2 0 V3 and A2 yf 0 and A3 yf 0 . Without loss 
of generality we can assume that A2 is an initial segment (in “columns” of V2) 
and so is A3 (in “rows” of V3) (cf. Fig. E}. Indeed, after such a transformation 
the number of inner edges in A2 and A3 cannot decrease by Theorem [I] and 
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Fig. 3. Compression of A in case 2 



the number of edges between Ai and A2 (also between Ai and 2I3) remains the 
same. Let IA2I = k2P + S2 and I2I3I = k^p + S3 with 0 < 1^2, < p. We can 
assume k2 > ks. Now we consider an (p — ^2) x (p ~ ks) rectangle V5 C Vi with 
V5 = {(x,y) G Vi I ^2 < a; < p, ks < y < p} and replace ^5 = A fl V5 with 
the initial segment in V5 of the same size. This initial segment is shown by the 
dark gray area in Fig.[ 3 l By Theorem [T| (applied for V5), the resulting set will be 
optimal too. This is true because the number of inner edges within A5 will not 
decrease, and the number of edges between A3 and Vi \ V3 remains the same. 
Let IA5I = (p — k3)k5 + 65 with 0 < ^5 < p — ks. 

Without loss of generality we can assume 62 = 0 . Indeed, otherwise if S3 > 
P—S2 then we move {x, ^3) G A3 for 52+(53 < x < P+S3 to (k2,x—S2+p) G Vi\A2. 
For the resulting set C one has 

|7(C)| - |7(A)1 = (p - 52)(52 - ((<52 + <53 - p) + fe2 - fcs)) > 0. 

If 0 < (53 < p — (52 then we move {x, fcs) G A3 for p < a: < p + ( 5 s to {k2,x + S2) G 
V2 \ A2. For the resulting set C one has 

|7(G)| - |7(A)1 = (5s(52 + k2- fcs) > 0. 

If (53 = 0 then A3 7^ 0 implies ^3 > 0 . We move (x, ^3 — 1 ) G A3 for p+<52 < x < 2 p 
to (fejx) G V2 \ A2. For the resulting set C one has 

|7(C)| - |7(A)1 = (p - 52){k2 - 73 + 1) > 0. 

In all cases we get a contradiction with the optimality of A. 

Case 2 a. Assume fcs > 0 . If fcs > k3 or k^ = k3 and ^3 = 0 , then |A| < 2 p^. 
We move (x, y) G A3 to {y + k2,x) G Vi \ A2. This results in a set C C Vi U Vi 
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with |/(C')| > |/(^)|, and we apply the arguments of case 1 to further transform 
Cto T‘^{\A\). 

If 1 < /c5 < ^3 or 1 < fcs = fca and ^3 > 0 , then we move {x,y) S A3 for 
p < X < 2 p and 0 < y < to {y + k2,x) € V2 \ A2 . For the resulting set C one 

has 

| 7 (C)| - \I(A)\ = ksp{k2 - (fc3 - fcs)) - Ssks > ks{p - S3) > 0 . 

Moreover, IC3I < |^3|, and C satisfies the assumptions of case 2 . Applying 
similar arguments to C we obtain a set D either with D3 = 0 or with k^ = 0 . In 
the first case we apply to D the arguments of case 1 and transform it to 1F^(|A|). 

Case 2 b. Assume k^ = 0 and k3 > 0 . We show that in this case A is not 
optimal. We start with interchanging (a;, 0 ) G A3 for p + S3 < x < 2 p and 
(x, ^3) G V3\ A3. This transformation does not affect the number of inner edges in 
a set. First assume ^2 < p— 1 . If > 0 we move {k2, y) G Ai for k3 < y <k3 + S3 



F2 



k2 




B 2 






5s 



Fi 




F2 



V4 






Fig. 4. Transformation of A into B and C in case 2b 



to (^2 + I, y). The resulting set B has the same number of inner edges. If Js = 0 
we set B = A. The set B is shown in the left part of Fig. Denote 

P = {(x, fca — e) G A I /C2 < X < 2p}, 

Q = {{k 2 ,y) ^ A \ k3 - e <y <2p + k3 - k 2 - e} 

where e = 0 if ^3 > 0 and e = 1 if ^3 = 0 . Now |P| = |Q| = 2 p — k2 and 
we replace P with Q (cf. Fig 3 ). We show that for the obtained set C one has 
|/(C)| > \I{B)\. Indeed, if ^3 > 0 then for A = |/(C)| — |/(P)| one has 



A = [|/(Q)| + |7(Q, A)|] - [|7(P)| + |7(P, A \ P)\] 



+ (2p — 72)^2 + 



p + ks — k2 
2 



+ (p + ks- k2){p - i) 
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(P - fc2)(fes + fe) + 



p-k2 
2 



+ P{P 




+ kiSi + {p — S 3 ){k 3 — 1) 



= pk 2 + (fcs - k 2 ){p -i) - pkz - p + (53 = i{k 2 - fcs) + (p - 63 ) > 0. 



Similarly, if (5a = 0 then A = i{k 2 — fca + 1) > 0. 

Now assume ^2 = P — 1- In this case we construct the set C by replacing P 
with the set i? = {(p — 1, y) ^ A | fca + ^5 — e < y < p + fca + (5s — e} with e 
defined above. For A = |/(C)| — \I{B)\ one has 



^ = [\i{R)\ + - [\m\ + \i(p,A\p)\] 

_ ~ (Is — fca — ^ + 1 + 5s^ 

+p(p — 1) + (p — (5s — fcs — l)(fe3 + 1 + (5s) + (fe + 1 + Ss){p — i) 




+ ksSs + (p — 53){k3 — 1) + p(p — i) 



= 2(p — (5s — fcs — l)(fe3 + 1 + (5s + i/2) + (p — S 3 ) +P(5s >p — 83 > 0. 



In all cases we get a contradiction with the optimality of A. 

Case 2c. We assume that ^5 = 0, fca = 0 and da > 0. Similar arguments to 
the case 2 b. can be used to show contradiction to the optimality in this case. 
We omit the proof because of space limitations. 

Case 3. Assume Vi C A. Then A = V \ A C V 2 LI V 3 U V 4 , and we can apply 
the arguments of case 1 and case 2 to A and transform it to J-‘^{\A\). Since, by 
Lemma [3l both A and A are optimal sets then the proof is completed. □ 

Due to the local-global principle for the lexicographic order [2], we have the 
following result: 



Corollary 2. Ifl<i<p — i, then |/(lF”(|A|))| > |/(A)| for any A C 1^” and 
n> 1 . 



The next theorem shows that the assumption concerning i is important for 
the existence of an optimal order. 

Theorem 3. There exist no optimal order for Hp^i x Hp i if p > S and p — i < 
i <p — 1 . 

We omit the proof of this Theorem due to space limitations. Note, that 
TheoremElis not true for i = p — 1. In this case Hp^i (resp. Hp J is isomorphic to 
Kp X K 2 (resp. Kp x Kp x K 2 x K 2 ) and the existence of an optimal order follows 
from Theorem [U Furthermore, Theorem Elis also not true for t = p. In this case 
Hp^i (resp. Hp J is non-connected and is isomorphic to two disjoint copies of Kp 
(resp. to four disjoint copies of Kp x Kp). In this case the lexicographic order is 
also optimal as it follows from [3j. 
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4 Concluding Remarks 

Results similar to Theorems |2] and can be obtained for the graph Bp^i = 
where 

Vb,,, =R'UR", R' = {0,2,...,2p-2}, R" = {1, 3, . . . , 2p - 1}, 

= {(m, v) \ u G V, V G V" , u + v=/= {2p — j) mod 2p, j = 1, . . . , i}. 

It is easily shown that for any m the set {0, 1, . . . , m — 1} is a solution to the 
EIP on Bp,i. With this numbering the lexicographic order onV = Vb i x Rb i 
is well-defined. 

Theorem 4. Let i < p/2 — i and let p > 4, he even. Then \I{T‘^{\A\))\>\I{A)\ 
for any A CV x V. 

The local-global principle [2] implies that a similar result holds for any carte- 
sian power of Bp^i. We also have some preliminary results concerning the EIP 
for the cartesian powers of Kp p {t parts) with up to two t-factors deleted. In 
these cases the lexicographic order is also optimal. 
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Abstract. We study the following Constrained Bipartite Edge Coloring 
( CBEC) problem: We are given a bipartite graph G{U, V, E) of maximum 
degree I with n vertices, in which some of the edges have been legally 
colored with c colors. We wish to complete the coloring of the edges of G 
minimizing the total number of colors used. The problem has been proved 
to be NP-hard even for bipartite graphs of maximum degree three [5]. 
In previous work Caragiannis et al. [2] consider two special cases of the 
problem and proved tight bounds on the optimal number of colors by 
decomposing the bipartite graph into matchings which are colored into 
pairs using detailed potential and averaging arguments. Their techniques 
lead to 3/2-aproximation algorithms for both problems. In this paper 
we present a randomized (1.37 + o(l))-approximation algorithm for the 
general problem in the case where max{Z,c} = cu(lnn). Our techniques 
are motivated by recent work of Kumar El on the Circular Arc Coloring 
problem and are essentially different and simpler than those presented 
in [2]. 



1 Introduction 

Konig’s classical result from graph theory [TO], states that the edges of a bipartite 
graph with maximum degree I can be colored using exactly I colors so that 
edges that share an endpoint are assigned different colors (see also [T]). We call 
such edge colorings legal colorings. Konig’s proof [TO] is constructive, yielding a 
polynomial-time algorithm for finding optimal bipartite edge colorings. Faster 

* This work was partially funded by the European Communities under 1ST FET 
Project ALCOM-FT and RTN Project ARACNE. 
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algorithms have been presented in | 3|6 |7|15J. These algorithms usually use as a 
subroutine an algorithm that finds perfect matchings in bipartite graphs [UTl]. 

Bipartite edge coloring can be used to model scheduling problems such as 
timetabling. An instance of timetabling consists of a set of teachers, a set of 
classes, and a list of pairs (t, c) indicating that the teacher t has to teach class c 
during a time slot within the time span of the schedule [15]. A timetable is an 
assignment of the pairs to time slots in such a way that no teacher t and no class 
c occurs in two pairs that are assigned to the same time slot. This problem can 
be modelled as an edge coloring problem on a bipartite graph. 

In real-life situations, the problem is made somehow harder due to addi- 
tional constraints that are imposed on the solutions. This is a general feature 
of practical optimization problems and it is due to the fact that an optimiza- 
tion problem at hand is most of the time just a subproblem of a larger-scale 
optimazition that one seeks to obtain. In the example of scheduling classes and 
teachers, it is sometimes the case that some teachers have been assigned to a 
timeslot because of some other duties that they have to attend during other time 
slots; thus, assignments will have to take into account this extra restriction. So, 
usually, additional constraints that are put on a timetable make the problem 
NP-complete |3]. 

Caragiannis et al. in [2] study two special cases of problem CBEC that arise 
from algorithmic problems in optical networks (see jI2lt)j l. Their results can be 
summarized as follows: 



— Problem A: Some of the edges adjacent to a specific pair of opposite vertices 

of an /-regular bipartite graph are already colored with S colors that appear 
only on one edge {single colors) and D colors that appear on two edges 
{double colors). They show that the rest of the edges can be colored using at 
most max{min{/ -|- D, ^}, I + } total colors. They also show that this 

bound is tight by constructing instances in which max{min{/ -|- D, y},/ -I- 

colors are indeed necessary. 

— Problem B: Some of the edges of an /-regular bipartite graph are already 
colored with S colors that appear only on one edge. They show that the 
rest of the edges can be colored using at most max{/ -|- S/2, S} total colors. 
They also show that this bound is tight by constructing instances in which 
max{/ -I- S/2, S'} total colors are necessary. 



Their techniques are based on the decomposition of the bipartite graph into 
matchings which are colored into pairs using detailed potential and averaging 
arguments. Their results imply 3/2-aproximation algorithms for both problems. 

The original proofs in [2] consider /-regular bipartite graphs G{U, V, E) with 
\U\ = \V\ = n/2. However, these results extend to bipartite graphs of maxi- 
mum degree / with n vertices using a simple observation presented in Section |2] 
Note that CBEC has been proved to be NP-hard even for bipartite graphs of 
maximum degree three |5]. 
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Our approach. In this paper, motivated by recent work of Kumar EH on the 
circular arc coloring problem, the steps we follow to obtain a provably good 
approximation to problem CBEC are summarized below: 

— Given a bipartite graph of maximum degree I in which some of the edges 
are legally colored with c colors, we reduce the problem to an integral mul- 
ticommodity flow problem with constraints. 

— We formulate the multicommodity flow problem as a 0-1 integer linear pro- 
gram. 

— We relax the integrality constraint, and solve the linear programming relax- 
ation obtaining an optimal fractional solution. 

— We use randomized rounding to obtain a provably good integer solution of 
the integral multicommodity flow problem which corresponds to a partial 
edge coloring. 

— We extend the edge coloring by assigning extra colors to uncolored edges. 

In this way we extend the coloring of the edges of G using a total number of 
colors which is provably close to the optimal one. Our algorithm is randomized 
and works with high probability provided that the optimal number of colors is 
large (i.e., a;(logn)). 

Roadmap. The rest of the paper is structured as follows. We present the reduc- 
tion from the costrained bipartite edge coloring problem to an integral multi- 
commodity flow problem in Section E] In Section El we demostrate how to ap- 
proximate the solution of the integral multicommodity flow problem and prove 
that this solution corresponds to an approximate edge coloring. An improvement 
to our approach is presented in Section S] 

2 Bipartite Edge Coloring and Multicommodity Flows 

In this section we describe the reduction of an instance of problem CBEC to an 
instance of an integral multicommodity flow problem with constraints. We first 
present a reduction of the initial instance of the CBEC problem to the following 
one. 

Let G = (U, V, E) be a bipartite graph with n = rii + ri 2 vertices, with 
U = {ui, ...,M„j}, V = {ui, ...,Vn 2 }, and with maximum degree I, in which some 
of the edges in E are already legally colored. For any integer k > 0, we construct 
the bipartite graph Gk = {A, B, E{Gk)) where the sets of vertices A and B are 
defined as 

A = {x,\u, e [/} U {y[\vi € V}, 

and 

B = {yi\vi G E} U {x'i\ui G U}. 

For graph Go, the set of edges E{Go) is defined as follows. For any edge {ui, vj) G 
E{G) with Ui £ U and Vj G V, E{Gq) contains two edges: {xi,yj) and (a;',y'). 
We call these edges regular edges. Also, let I be the maximum degree of G and 
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let d{ui) (resp. d{vi)) be the degree of a vertex m £ U (resp. Vi £ V) in G. The 
edge set E(Gq) also contains I — d{ui) copies of for i = and 

I — d{vi) copies of {yi,y'i) for i = These edges are called cross edges. 

Graph Gk for fc > 0 is obtained from Go, by adding k copies of the edges {xi,x'^ 
for i = I, and k copies of the edges (yi,y[), for i = 1, ...,n 2 - An example 

for the construction of graph Gk from G is depicted in Figure |T] 




Lemma 1. G can be edge colored with I + k colors iff Gk can be edge colored 
with I + k colors. 

Proof. Since G is a subgraph of Gk, any legal edge coloring of Gk trivially yields 
a legal edge coloring of G. 

Assume that we have a legal edge coloring of G with I + k colors. Then, the 
edges of Gk can be colored with I + k colors as follows. For any edge (ui,Vj) G 
E{G) colored with a color y, we color the edges (xi,yj) and {x\,yl) of E{Gk) 
with y. This gives a partial edge coloring of Gk in which the cross edges are 
uncolored. Let ut £ U (resp. Vi £ V) and let d{ui) (resp. d{vi)) be the degree of 
Ui (resp. Vi) in G. The cross edges between Xi and x) (resp. between yi and y') are 
now constrained by d{ui) (resp. d{vi)) colors. Thus, we can use the l + k — d{ui) 
(resp. I + k — d{vi)) colors not used by edges adjacent to Ui (resp. Vi) to color 
the cross edges between Xi and a;' (resp. yt and y[). This completes the coloring 
of Gk with I + k colors. □ 
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Now, for any integer k > 0, consider the multicommodity network = 
{W, A, B, Z, E{Hi.)) constructed as follows. Sets of vertices A and B are the 
same with those of graph Gk ■ Also, 

W = {wi, ...,wi+k} 

and 

Z = {zi,...,zi+k}- 

The set E{Hk) is defined as 

E{Hk) = E{Gk) 

U{(uii, Xj)\l<i<l + kA^j < til} 

U{(tCi, j/')|l < i < ^ + fc, 1 < j < 112 } 

U{(i/j,Zi)|l < t < / + A:, 1 < j < 712 } 

U{(x' , Zj)|l < Z < Z + /C, 1 < j < Til} 

All the edges in E(Hk) have unit capacity, and an edge can carry only an 
integral amount of flow for each commodity. The source for the i-th commodity 
is located at Wi, while the corresponding sink is located at Zi. An example for 
the construction of network Elk from graph Gk is depicted in Figure [1] 

Intuitively, an integral flow of the l+k commodities corresponds to a (partial) 
legal coloring of the edges of Gk- an edge between A and B carrying one unit of 
flow for commodity i in Hk corresponds to an edge colored with color i in Gk- 
Since some of the the edges of the graph Gk are precolored, our multicom- 
modity flow problem has some additional constraints. If an edge is precolored 
with color i in Gk, it is constrained to carry a unit amount of flow for commodity 
i in Elk- So, we can reduce an instance of CBEC to multicommodity flow with 
constraints using the following observation. 

Lemma 2. Gk can be edge colored with I -E k colors iff there is an integral 
(constrained) flow of value n(l + k) for commodities 1, ..., I + k in network Hk- 

In the next section we show how to approximate the corresponding integral 
constrained multicommodity flow problem, and, using the reduction above, we 
obtain a provably good solution for the initial instance of CBEC. 

3 Approximating the Multicommodity Flow Problem 

In general, integral multicommodity flow (without constraints) is NP-complete 
P]. However, it is straightforward to formulate the constrained multicommodity 
flow problem as a 0-1 integer linear program and solve its linear programming 
relaxation by setting aside the integrality constraint. In this way, we obtain an 
optimal fractional solution. 

Clearly, max{Z, c| is a lower bound on the minimum number of colors suf- 
ficient for extending the partial edge-coloring of G- We begin with network 
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^^max{i,c}-;j solving the corresponding linear program LP„iax{i,c}-o If the max- 
imum flow is smaller than nmax{Z,c}, this means that the integer linear pro- 
gram has no flow with value nmax{^, c}, meaning (by Lemma[^ that there exist 
no coloring of Gmaxti.c}-/ with max{^, c} colors. We continue with networks 
^^max{z,c}-i+i) ^I^max{i,c}-/-i- 2 j •■•) Until we And some L such that the solution of 
LPl-; gives a fractional (constrained) multicommodity flow of value nL. Clearly, 
L is a, lower bound for the minimum number of colors sufficient for coloring the 
edges of Gl-i- 

Now, we will use the fractional solution of the linear program LPi_; in 
order to obtain a solution for the corresponding integer linear program ILPi_; 
which is provably close to the optimal one. We will use the randomized rounding 
technique proposed by Raghavan HU. 

Let / be the flow obtained by solving LPi_;. Flow / can be decomposed into 
L flows /i, / 2 , /l; one for each commodity. Each fi can be further broken up 

into sets Piq, Pi^ 2 , •••, of n vertex-disjoint paths from Wi to Zi (i.e., the edges 
between A and B in each set of vertex-disjoint paths forms a perfect matching) 
each carrying an amount rriij of flow for commodity i, such that '^i,j — 

fi = l. We call the procedure of decomposing flow matching stripping (since it 
is similar in spirit to the path stripping technique proposed in m)- 

Lemma 3. Matching stripping can be done in polynomial time. 

Proof. Matching stripping can be performed as follows. Consider a solution to 
LPfe and the associated flows for commodity i in network Hk. Set j = 1. Let Cj 
be the edge carrying the smallest non-zero amount mij of flow for commodity 
i. Find a set Pi j of n vertex-disjoint paths from Wi to Zi containing edges that 
carry non-zero amount of flow for commodity i including Cj. Associate amount 
rriij with Pij and subtract amount niij from the flow for commodity i carried 
by each edge in Pij. Repeat this process for j = 2, 3, ..., until no flow remains. 
This will decompose the flow fi into sets of n vertex-disjoint paths Pij between 
Wi and Zi each carrying amount mij of flow for commodity i. 

We first inductively prove that a set of n vertex-disjoint paths Pij from 
Wi to Zi can be found at any execution of the above process. Let Ci be the 
edge carrying the smallest non-zero amount of flow for commodity i in the 
beginning of the first execution. Assume that any set of vertex-disjoint paths 
from Wi to Zi containing edges that carry non-zero amount of flow for commodity 
i including ej has size at most n— 1. This means that there is no perfect matching 
containing e\ in the subgraph of Plk containing the vertex sets A and B and the 
edges between them that carry non-zero amount of flow for commodity i. By 
Hall’s Matching Theorem (see II]), we obtain that there exists a set S' C A (such 
that Cl is incident to one of its vertices) with neighborhood N{S) C R of size 
|fV(S)| < |S| — 1. Observe that since the solution of LP/^ is optimal the edges 
incident to a vertex of A carry unit total amount of flow for commodity i. Thus, 
the edges incident to S carry a total amount |S| of flow for commodity i and, 
since |A^(S)| < |S| — 1, the capacity constraints for some of the edges incident 
to N{S) are violated. Thus, a perfect matching Mi^i containing edges between 
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A and B including Ci exists. The set of vertex-disjoint paths is constructed 
by adding all edges between Wi and A and all edges between B and Zj to M^^i. 

Assume now that j — 1 sets of n vertex disjoint paths Bi^i, Pi^ 2 , •••, Pi,j-i 
between Wi and Zi have been constructed in the beginning of the j-th execution 
of the above process and let mi_ 2 , •••, "rriij-i be the associated flows for 

commodity i. Furthermore, assume that there still exists an edge which carries 
non-zero amount of flow for commodity i. Note that an amount of 
of flow for commodity i has been subtracted from each edge between Wi and A, 
from each edge between B and Zi, from the edges between A and B incident to 
each vertex of A, and, similarly, from the edges between A and B incident to 
each vertex of B. Following the same reasoning as above, we consider the edge 
tj carrying the smallest non-zero amount of flow and we obtain that there exists 
a perfect matching Mi j between A and B containing edges that carry non-zero 
amount of flow for commodity i including tj (otherwise, some the edge capacity 
contraints in the original solution of LP^ would have been violated). Again, the 
set Pij of vertex-disjoint paths is constructed by adding all edges between Wi 
and A and all edges between B and Zi to Mij. 

We now easily prove that the number ti of executions of the above process 
is polynomial. Observe that after the j-th execution, there exists at least one 
edge (cj) which carry zero amount of flow, and, thus, it will not be considered 
in the construction of paths Pi^t for t > j. Thus, the number of executions of 
the process is at most the number of edges between A and B, i.e., ti < n{l + k). 

The lemma follows since maximum bipartite matching can be solved in poly- 
nomial time. □ 

In order to obtain an integer solution for ILPi_;, for each commodity i, 
we will select one out of the ti sets of vertex-disjoint paths, and use its edges 
to route commodity i. To select a set of vertex-disjoint paths for commodity 
i, we cast a t^-faced die (one face per each of the ti sets of vertex-disjoint 
paths) where niij are the probabilities associated with the faces. The selection 
is performed independently for each commodity. Performing this procedure for 
each commodity, we obtain L sets of n vertex-disjoint paths to route the L 
commodities. 

However, these sets of n vertex-disjoint paths may not constitute a feasible 
integer solution to WjP l-i since some edge capacities may be violated. Since 
in the fractional solution an edge between A and B may carry more than one 
commodity, it is possible that, during the rounding procedure, more than one 
commodities may select sets of vertex-disjoint paths that contain that edge. 

Next, in each edge between A and B that was selected by more than one 
commodities we arbitrarily select one commodity that will use this edge. In this 
way, we obtain a feasible integer solution for ILPi_;. 

Note that the feasible solution of the integral multicommodity flow problem 
in Hl-i corresponds to a partial edge coloring of Gl-i with L colors. We also 
have to assign extra colors to the edges that do not belong to the sets of vertex- 
disjoint paths that were selected by the rounding procedure. Let be the 

(random) subgraph of Gl-i that contains all vertices of Gl-i and the edges 
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that do not correspond to edges of that were selected by the rounding 

procedure. 

Next, in Lemma we will provide an upper bound on the maximum degree 
of graph Our proof is based on the following technical lemma on a well- 

known occupancy problem. A proof can be found in Kumar El (see also El). 

Lemma 4. Consider the process of randomly throwing mi balls into m 2 bins 
such that the expectation of the number of balls thrown into any bin is at most 
one. For the random variable Z denoting the number of empty bins, it holds that 

Pr[Z > m 2 — TOi + mi/e + X^/ml] < 2 exp(— A^/2). 

Lemma 5. The maximum degree of Gf_i is at most L/e + 2-\/L Inn, with prob- 
ability at least 1 — 4/n. 

Proof. During randomized rounding, each commodity i randomly selects a set 
of n vertex-disjoint paths between the source Wi and destination Zi in H^-i. 
Thus, for each node u of Gl-i, one of the L edges incident to u is selected to 
carry unit flow for a specific commodity. Intuitively, we can think of the integral 
flow for each commodity as a ball and the edges incident to a vertex u as bins. 
The randomized rounding procedure can be modelled by the classical occupancy 
problem where L balls are to be randomly and independently thrown into L bins 
with the restriction that the expectation of the number of balls thrown into any 
bin is at most one (this is due to the edge capacity constraints of the flow 
problem). Using Lemma 01 with m\ = m 2 = L and A = 2-\/lnn, we obtain that 
the random variable denoting the number of empty bins, i.e., the number of 
edges incident to u which are not selected for carrying flow for any commodity, 
is at most LjeF 2\jL Inn with probability at least 1 — 2/n^. 

Thus, the probability that more than L/e+2\/L Inn edges incident to some of 
the 2n vertices of Gfe have not been selected after the execution of the randomized 
rounding procedure is at most 2n • 2/n^ = 4/n. The lemma follows. □ 

By Lemma[5l the edges of Gf_i can be colored with at most L/e + 2 \/ L In n 
extra colors, with high probability. Thus, in the case where L is large (i.e., 
L = w(lnn)), we have proved the following theorem. 

Theorem 1. With very high probability, the algorithm uses at most (l + l/e)L+ 
o{L) total colors. 

Since Lis a, lower bound to the optimal number of colors sufficient for coloring 
the edges of the bipartite graph, we obtain that our algorithm has approximation 
ratio 1 + 1/e + o(l) = 1.37 + o(l). 

4 Decreasing the Number of Colors 

In this section we discuss some modifications of our algorithm which lead to a 
better upper bound on the total number of colors sufficient for solving instances 
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of CBEC. Note that this improved result does not imply an approximation ratio 
better than the one obtained in Section 

We slightly modify the reduction described in Section Consider again the 
bipartite graphs Gk = {A, B, E{Gk)) (for integer fc > 0) defined in Section|^ For 
any integer k > max{^, c} — I, we construct the multicommodity flow network 
i/' = {W', A, B, Z', E{H'^)) where now 

W = rCmax{Z,c}} 

and 

^ {■^1 ; ■ -^max{Z,c} }■ 

The set E{H'j.) is defined as 

F(iJ' ) = F(Gfc) 

U{(wi,Xj)|l <i< max{Z,c}, 1 < j < ni} 

U{(wj,y')|l < i < max{Z,c},l <j< 712 } 

VJ{{yj,Zi)\l < i < max{Z,c}, 1 < j < 712 } 

U{{x'j,Zi)\l < i < max{/, c}, 1 < j < n^} 

Our reduction is now based on the following lemma. 



Lemma 6. Gk can he edge colored with I + k colors iff there is an integral 
(constrained) flow of value nmax{Z,c} for commodities in network iJ(,. 

Proof. A coloring of Gk with I + k colors can be reduced to an integral (con- 
strained) flow of value n max{l, c} for commodities in network H). by making each 
edge between A and B colored with some color i in Gk (for 1 < i < max{^, c}) 
carry a unit amount of flow for commodity i. 

Given an integral (constrained) flow of value nmax{Z,c} for commodities in 
network El'^,, we can achieve a partial coloring of Gk with max{^, c} colors by 
using color i (for 1 < f < max{Z, c}) to color an edge which carries a unit amount 
of flow for commodity i. We observe that the vertex-induced subgraph of Gk 
which contains the edges of Gk left uncolored is (^-l-A: — max{/, c})-regular. Thus, 
l + k — max{Z, c} colors can be used to complete the coloring of the edges of Gk 
with I + k colors in total. □ 



The general structure of our approach is the same with the one described 
in Section [HI We begin with network c}-;> solving the corresponding lin- 

ear program LPniax{i,c}-/- If the maximum flow is smaller than nmax{^, c}, this 
means that the integer linear program has no flow with value nmax{Z, c}, mean- 
ing (by Lemma O that there exist no coloring of Gmax{z,c}-z with max{Z,c} 
colors. We continue with networks I?(nax{/.c}-z-Hi> ■f^max{z.c}-z+ 2 ^ -- until we find 
some L such that the solution of LPl_; gives a fractional (constrained) multi- 
commodity flow of value nmax{/,c}. By Lemma El L is a lower bound for the 
minimum number of colors sufficient for coloring the edges of Gl-i- 
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Then, we use the fractional solution of LP/,_/ to obtain a feasible solution of 
ILP/,_; using randomized rounding. Again, we can prove that matching stripping 
can be correctly performed in polynomial time; however, some minor modifica- 
tions are needed in the proof of Lemma 

In order to obtain an upper bound on the degree of the graph G"^_; (the 
subgraph of Gl-i containing edges of Gl-i left uncolored after the application 
of the rounding procedure), we again use Lemma 0 (with mi = max{Z,c} and 
TO2 = L) to show that G'^_; can be edge colored with L— max{/, c}-|-max{Z, c}/e-|- 
o(max{Z, c}) additional colors. In this way, when max{/, c} = w(lnn), we obtain 
the following. 

Theorem 2. With very high probability, the algorithm uses at most 7^_|_ _|_ 

o(max{Z, c}) total eolors. 
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Abstract. A maximum clique transversal set in a graph G is a set S of 
vertices such that every maximum clique of G contains at least a vertex 
in S. Clearly, removing a maximum clique transversal set reduces the 
clique number of a graph. We study algorithmic aspects of the problem, 
given a graph, to Hud a maximum clique transversal set of minimum 
cardinality. We consider the problem for planar graphs and present fixed 
parameter and approximation results. 

We also examine some other graph classes: subclasses of chordal graphs 
such as k-trees, strongly chordal graphs, etc., graphs with few P 4 S, com- 
parability graphs, and distance hereditary graphs. 



1 Introduction 

A maximum clique transversal set in a graph G is a set S of vertices such 
that every maximum clique of G contains at least one vertex in S. The clique 
transversal set is the same, but with “maximum clique” replaced by ’’maximal 
clique” mm- The maximum clique transversal set problem is, given a graph 
G, to find a maximum clique transversal set of minimum cardinality of G. 

In this paper, we study algorithmic aspects of the maximum clique transversal 
set problem. One of the main objectives for this research is the placement of 
transmitters for mobile telephones. One common method (mostly) used to place 
transmitter for mobile telephones is to place them at the corner points of a 
hexagonal grid that is placed over the area of the country. 

A common solution is the placement of another transmitter close to it when a 
transmitter can no longer handle all communication requirements. The drawback 
is that this new transmitter needs another frequency to avoid interference. At 
some instance the number of these transmitters that are placed close to each 
other grows so large, that it is desirable to replace them by a more efficient (but 

* Most of this work was done while this author visited Chung Cheng University. During 
the initial part of the research for this project this author was supported by an 
Earmarked Research Grant from the Research Grants Council of Hong Kong. 
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also much more expensive) big transmitter tower. These towers are quite large, 
and contain also very costly hardware to switch between different frequencies 
between transmitters contained in it in some optimal way. 

The problem stated as a graph algorithmic problem is to find the minimum 
number of vertices (transmitters) such that every maximum clique contains at 
least one element of this set. We emphasize on planar graphs since for this class 
of graphs the practical applications seem of most importance. Unfortunately, 
the problem remains NP-complete for planar graphs in general, but it turns out 
to be fixed parameter tractable. We present fixed parameter results for planar 
graphs in this paper and show that the optimal solution can be approximated 
within an arbitrary constant. 

For theoretical reasons we also investigate some other common graph classes. 
We consider subclasses of chordal graphs such as k-trees, strongly chordal graphs 
etc., graphs with few P4’s, comparability graphs, and distance hereditary graphs. 
It seems that the problem remains NP-complete in a class of graphs in which 
the clique transversal problem is NP-complete, and is polynomially solvable in a 
class of graphs in which the clique transversal problem is polynomially solvable. 

2 Preliminaries 

Let G be a graph. The maximum clique number of G is denoted by lo{G) and 
the domination number of G is denoted by 7(G). For a subset S of vertices we 
let G — S' be the graph induced by V \ S. For every a: € U, we use S — x and 
S -I- a; to represent S — {a;} and S U {a;}. 

Definition 1. A maximum clique transversal set in a graph G = (V, E) is a 
subset S QV of vertices such that every maximum clique of G contains at least 
a vertex in S. 

The maximum clique transversal problem is the problem of finding a maxi- 
mum clique transversal set in G of minimum cardinality. We write h{G) for the 
minimum size of a maximum clique transversal set in graph G. 

Definition 2. Let G be a graph and consider the union of all maximum cliques 
in G. Call the subgraph induced by this union the subgraph induced by all maxi- 
mum cliques, denoted by G* . 

We start with a very basic observation. 

Lemma 1. There is a maximium clique transversal set of cardinality k in graph 
G if and only if it is a clique transversal set in G* . 

The following lemma is also quite obvious. Notice that every maximum clique 
transversal set in a graph G is a dominating set of G*, since every vertex of G* 
is contained in at least one maximum clique. 

Lemma 2. For any graph G, q(G*) < h{G). 
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3 Fixed Parameter Results for Planar Graphs 

We here use two widely-applicable techniques, search-tree method and reduction 
to a problem kernel to achieve our fixed parameter results for planar graphs. 
Before concerning them first notice that the maximum clique transversal problem 
remains NP-complete when restricted to planar graphs: 

Lemma 3. The maximum cique-transversal problem is NP-complete in planar 
graphs. 

Proof. This follows from the NP-completeness of the vertex cover problem when 
restricted to triangle-free planar graphs. Notice that the independent set problem 
remains NP-complete when restricted to triangle-free, 3-connected, cubic planar 
graphs HZ]. □ 

Recall the definition of a layer-decomposition Li, . . . , of a planar graph 
G, where Li is the outerface and t an upperbound for the outerplanarity of G. 

For further definitions and notions on special graph classes not defined here, 
we refer to [H]. 

Remark 1. Notice that the number of 4-cliques in a planar graph is linear: Every 
4-clique in a plane embedding has a vertex “in the middle” which is unique for 
this 4-clique. It can also be proved that the number of triangles in a planar graph 
is linear. The clique number of a planar graph is less than 5. Hence the number 
of all maximum cliques in a planar graph is linear. 

3.1 Search- Tree Method 

From |18| . the clique number of a planar graph can be found in linear time and 
we can have the following lemma by I10I15I18I . 

Lemma 4. There exists a linear time algorithm to list all maximum cliques of 
a planar graph. 

Following the above lemma, all maximum cliques of a planar graph can be 
generated in linear time. We now construct a search tree as follows. Consider 
all maximum cliques of G in arbitrary order Gi,...,G^. Label the root R of 
search tree with the empty set and then build the search tree that consists of R 
with children which are the vertices of an arbitrarily selected maximum clique 
Gi, 1 < i < £. Each child v gets children that correspond with the vertices of 
another maximum clique G that does not contain v. In general, each node v of 
the search tree gets children that correspond with the vertices of a maximum 
clique G that does not contain any vertex in the path from node v to the root 
of the search tree. 

Notice that for a maximum clique transversal set of size at most k, we have 
to consider only paths of this search tree from the root with at most k vertices. 

Since each node has degree at most w(G), the total number of these paths 
is bounded by uj^ where uj = uj{G). We search the tree by depth-first traversal 
to determine for each path if it indeed hits every maximum clique in 0(n) time. 
Hence the total time needed for our search tree algorithm is 0{uj^n). 
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Theorem 1. For each constant k there exists an algorithm which runs in time 
0{uj^n) to determine if there is a maximum clique transversal set in a given 
planar graph G with n vertices and io{G) = uj. 

For more weighty k the following algorithm can be beneficial. Recall the 
result of H]: 

Theorem 2. Let G be a planar graph and let k be a constant. There exists an 
algorithm which runs in time 0(c^n) to find a dominating set in G if it exists. 
Here c is a moderate constant. 

Our algorithm computes an optimal maximum clique transversal set for pla- 
nar graphs in linear time works as follows. Since 7 (G) < h{G) < k for a constant 
k, we know that the treewidth is at most 0{Vk) (see m- Furthermore, we can 
construct an approximate tree-decomposition of width at most 0{Vk) in time 

0{Vkn). 

We just use the customary way for finding a solution using dynamic programming 
on the tree-decomposition. 

Due to space limitations, we confine to mentioning the final result. 

Theorem 3. For every fixed constant k there exists a linear time algorithm to 
compute a maximum clique transversal set of size at most k (if it exists) in time 
0{c'^n). 



3.2 Reduction to a Problem Kernel 

In this subsection we show that we can either reduce the graph to a subgraph 
with O(fc^) vertices, or decide that no maximum clique transversal set can exist 
of order at most k. The original graph has a maximum clique transversal set of 
order at most k if and only if the kernel has a maximum clique transversal set of 
a small enough order. Combining this result with the algorithm of the previous 
section we obtain an algorithm that runs in time 0{u!^k^ + n) (or, alternatively, 
0{c'^k‘^ -|- n) ) . For simplicity we assume that in the following cases the clique 
number of a planar graph is 4. It is easy to see how to obtain corresponding 
solutions for the cases with smaller clique number. 

Notice that we may assume that every vertex of the graph G is contained in 
at least one 4-clique, since otherwise we could remove it from the graph together 
with all it’s incident edges. If some edge is not contained in a 4-clique we can 
remove the edge (without the two endpoints) from the graph. These changes 
can be performed in linear time (using the list of 4-cliques) and do not change 
the problem. Henceforth we assume that every vertex and every edge of G is 
contained in at least one 4-clique. 

Lemma 5. If there exists a vertex of degree larger than 3(fc-|- 1) then it must be 
contained in any maximum clique transversal set of size at most k. 
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Proof. Let a; be a vertex of degree larger than 3(fc + 1). Assume the degree of x 
is at least 3fc + 4. 

Recall that each edge is contained in at least one 4-clique. 

Since the graph is planar, at least fc -|- 1 4-cliques have only x in common and 
do not share any edge from each other. If we do not pick x as an element in the 
maximum clique transversal set, the size of any maximum clique transversal set 
is at least fc-l- 1. Hence, x must be in the maximum clique transversal set of size 
at most k. □ 

Remove all vertices of degree larger than 3 ( A: -I- 1) in G and lower the desired 
maximum clique transversal set cardinality by one for every removed vertex. 
Henceforth we assume that the degree of all vertices is at most 3(fc-|-l). Consider 
this kernel G* = {V*,E*). Let n* be the number of vertices in this kernel. 

Lemma 6. n* = |R*| = 0{k'^). 

Proof. Let c be the number of 4-cliques in G*. Since every vertex is contained 
in a 4-clique c > ^ . We claim there are at least vertex disjoint 4-cliques in 
G*. This is justly to prove by induction on c. 

Hence, c < 12A:^ and n* < 4c = O(fc^). □ 



Theorem 4. There exists an algorithm that runs in time 0{4^k^ + n) to deter- 
mine if a planar graph has a maximum clique transversal set of size at most k. 
Alternatively, there exists an algorithm that runs in time 0{c^k‘^ -|-n) for some 
constant c. 

4 Approximations for Planar Graphs 

In this section we show that we can approximate the size of a maximum clique 
transversal set for planar graph arbitrarily close. 

Recall that if a graph is d-outerplanar, the treewidth is at most 3d — 1, and for 
graphs with bounded treewidth the maximum clique transversal problem can 
easily be solved in linear time. 

We divide all nodes into k classes; each class corresponds to level congruent 
to i (mod fc) for z = 0, 1, . . . , A; — 1. For each i, let Gj be a (fc -I- l)-outerplanar 
graph induced by the levels jk -|- z to (j -I- l)fc + z , j > 0. For every Gj, find 
its optimal maximum clique transversal set by using tree-decomposition. Let Pit 
denote the union of these solutions for class i. Among Hq, Pli , . . . , pick 

the minimum size of them as our approximation solution. By using the similar 
analysis to Baker j4], we have the approximation at most optimal as follows. 

Consider any optimal solution H . For some s,0<s<fc— 1, at most 
nodes in P[ are in levels congruent to s (mod fc). For V) be the set of nodes 
in H in levels jk s through (j -|- l)fc -|- s, and V' the optimal solution for 
the subgraph induced by these levels. Obviously, \V-\ < \Vj\ for each j. Since 
only the nodes in levels congruent to s (mod fc) are computed twice, the sum 
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over j of the |V}|’s is at most However, according to our algorithm, our 

approximation solution is not larger than the sum of the |Vj'|’s. Hence for fixed 
k, we have a linear time algorithm to find a maximum clique transversal set of 
size at most optimal. We obtain: 

Theorem 5. For every constant e there exists an efficient algorithm which finds 
a maximum clique transversal set of size at most h{l + e) where h is the optimal 
maximum clique transversal set size of G. 

Proof Fix k such that ^ < e. h^^^ = h{l + ^) < h(l + e). □ 

5 Chordal Graphs of Diameter Two and k- Trees 

Consider an instance of the ordinary parameterised hitting set problem. Let S 
be a finite set and let C be a collection of subsets of S. Let k be the parameter of 
the problem. Then the (ordinary) hitting set problem asks for a subset S' Q S 
with 1 5" I < k such that S' contains at least one element of each subset of C. 

We can reduce this instance to the problem of finding a maximum clique 
transversal set as follows. Take as a central clique the clique with vertex set S, 
and define for each element c G C a clique Q{c) of cardinality w = [S'! + 1 with 
Q{c) n S = c and with unique extra vertices added to each Q{c) in order to 
obtain the cardinality u for each Q(c). Hence, the size of the maximum clique 
transversal set for this graph is at most k if and only if the original parameterised 
hitting set problem has a yes-instance for parameter at most k. 

The parameterised hitting set problem was proven to be VF[2]-complete by 
Wareham, as reported (personal communication) in the book on parameterised 
complexity of Downey and Fellows. 

Theorem 6. The parameterised hitting set problem is W[2]-complete for chordal 
graphs. 

k- Trees. We show that the maximum clique transversal problem for maxi- 
mum cliques remains NP-complete for k-trees with k unbounded. Due to space 
limitations we mention the following without proof. 

Lemma 7. For k-trees the maximum clique transversal number h{G) is equal 
to the domination number ^{G). 

Proof If G is a A: -I- 1-clique then h{G) = 7 (G). Otherwise, consider a simplicial 
vertex x adjacent to a /c-clique uj. Consider a dominating set D. If x € D we 
can replace x by another vertex from lo and obtain a dominating set of the same 
cardinality. By induction every maximal clique in G — a; contains at least one 
element of D. Hence D is a maximum clique transversal set. 

Now consider a maximum clique transversal set TL of minimum cardinality. 
If X £ TL replace it by another vertex of oj and thus obtain a maximum clique 
transversal set TL* of the same cardinality. By induction, TL* has a vertex in 
every maximum clique of G — cc. Hence also in G. 




38 Maw-Shang Chang, Ton Kicks, and Chuan-Min Lee 

Theorem 7. The hitting set problem remains NP-eomplete for k-trees with k 
unbounded. 

Proof. The domination problem is NP-complete for fc-trees (arbitrary k) [1 1 1 . 

□ 

Since G* of a strongly chordal graph G can be computed in linear time and 
G* is also a strongly chordal graph, following Lemma[Tl the linear time algorithm 
for the clique transversal problem in strongly chordal graphs can be used to find 
a minimum hitting set in G by finding a minimum clique transversal set in G* . 



6 Cographs and Graphs with Few P 4 S 

An algorithm for the clique transversal problem for cographs was already de- 
scribed in [Ill- 

Cographs are properly contained in the set of distance hereditary graphs for 
which we describe an efficient algorithm in the next section. In this section we 
briefly give the formulas for cographs, and the extension to graphs with few P4S, 
i.e., {q,q — 4)-graphs with fixed parameter q. Let G be a cograph which is the 
join of two cographs Gi and Gi (i.e., every vertex of Gi is made adjacent to 
every vertex of G2). Let hi = h{Gi) be the maximum clique transversal numbers 
for Gi {i = 1, 2). Then h{G) = min(/ii, /12). 

Let G be a cograph which is the disjoint union of two cographs Gi and G2 
(i.e., no vertex of Gi is adjacent to any vertex of G2). Let h\ be the maximum 
clique transversal number for Gi and h.2 the maximum clique transversal number 
for G2. If the clique number of Gi is larger than the clique number for Ga_i then 
h = h(G) = hi. If both clique numbers are the same: h = h\ + h2. 

Corollary 1. The maximum clique transversal set problem can be solved in lin- 
ear time for cographs. 



Graphs with Few P4S. Recall that Jamison and Olariu extended the concept 
of cographs to those of graphs with few P4S. They call a graph p-connected if 
for every partition of the vertex set into two non empty sets Vi and V2 some 
induced P4 has elements on both sides of the partition. A p-connected graph is 
called separable if there is a partition of its vertex set such that every crossing 
P4 has its midpoints in Vi and its endpoints in V2- Jamison and Olariu obtained 
an unique tree structure theorem. 

Definition 3. A graph is called a {q,q — 4) -graph if no set of at most q vertices 
induces more than q — 4 distinct P4S. 



Lemma 8. A p-connected component of a {q,q — 4)-graph is either a spider or 
it contains less than q vertices. 
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We consider {q, q — 4)-graphs for fixed parameter q. A lot of classes that 
have been examined before can be expressed as (g, q — 4)-graphs. (For details 
and other examples and results see the various papers of Jamison and Olariu on 
related classes [2|.) 

The decomposition tree for (g, g — 4)-graphs can be found in linear time by an 
algorithm of Bauman . Briefly we show how to extend the result for cographs 
to (g, g — 4)-graphs. This class of graphs has again a decomposition tree, similar 
to the one for cographs with a additional node type, and with some restricted 
type of leaves which they typed (thin or thick) spiders. 

It is easy to see that the maximum clique transversal set number of a thin 
spider with clique K and stable set 5" is 1 and the hitting set number for a thick 
spider is 2. 

A decomposition tree for graphs with few P4s can be obtained in linear time 
(see, e.g., |2]). For the tree nodes that are different from the complete sum or 
union, we obtain easy algorithms. 

Theorem 8. There exist linear time algorithm to eompute the maximum clique 
transversal set number for (g, g — 4) -graphs for each q. 

7 Comparability Graphs 

In this section, we consider the maximum clique transversal problem for compa- 
rability graphs. 

For a given comparability graph, its transitive orientation can be found in 
linear time [H]. In [Sj it was shown that there exists an 0{M {n)-\-rn^/n) algorithm 
to compute the clique transversal of a comparability graph. Here M(n) is the 
time complexity needed to multiply two nxn matrices. In the following we show 
how to use transitively-oriented graphs to solve the maximum clique transversal 
set problem for comparability graphs (and hence for permutation graphs) in 
0{my/n) time. 

Given a comparability graph G = (V, E) and its transitively-oriented di- 
graph D = {VtA). According to Redei [IS] every directed path in £> = (V,A) 
corresponds to a clique of G. We add a new vertex s to D and add arcs from s 
to every source vertex uin D. Correspondingly we add another new vertex t and 
arcs from every sink vertex w to t. We label s and t with numbers 0 and |R| -I- 1, 
respectively. Let D' = (R',A') denote this digraph. Clearly D' is cognizant of 
the incidence relationships in D. 

For solving the maximum clique transversal set problem, we remove some 
vertices and arcs from D* such that every s,t-path in the new digraph is of the 
same length in D' = {V , A'). 

We use the following notation: 

— l(y) is the length of a longest path from to t in D' . 

— d~{v) and d'^{v) represent the in-degree and out-degree respectively of v. 

— iV+(w) = {w I (v,w) € A'} is the out-neighbourhood of a vertex v. 

We omit the proof of the following lemma to keep the size within limits. 
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Lemma 9. For 1 < * < + 1, let L = {s' = vq,vi, . . . ,Vi = t) be a s' , t-path. 

If L is a longest s', t-path (an oriented geodesic), then each arc {vj-i,vj) in L 
has i{vj-i) = i{vj) + 1, where 0 < j < i- 

By Lemma 0 we know that an arc (u, v) with £{u) — £{v) ^ 1 does not exist 
in any longest s, t-path in D' . We remove arcs that have the same property as 
(u, v). If this removal leads to vertices having in-degree zero, we know that these 
vertices except s cannot appear in any longest s, t-path. 

We construct a digraph G* = {V*,E*) in 0(|W| -I- |I?'|) time as follows. 
By using a depth-first search, we compute £{u) for each vertex u in D* time 
(see, e.g., El)- Next, we follow the ordering of vertices to check the relationships 
between u and N~^{u) in the following: 

1. If £{u) — £{v) > 2, then remove the arc {u, v) and decrease d+(u) and d~{v). 

2. If d~ {u) = 0 and u s, remove the vertex u and all its neighbours in D' . 

Of course we decrease d~^{u) and d~{v). 



Lemma 10. All maximal paths in G* are s,t-paths of the same length. 

Proof. By algorithm of constructing G*, G* only has arcs {u, v) with £{u)—£{v) = 
1 and G* has the same source s and sink t as D'. Hence, all maximal paths in 
G* are s,t-paths. Let L = {s,vi, . . . ,t) be an s, t-path in G* . The sequence 
£{s) , £{vi) , . . . , £{t) of L is an ordered sequence with £{vi-i) = £{vi) — 1. By 
Lemma El L is an s, t-path of maximum length. Therefore all maximal paths in 
G* are maximum s, t-paths. □ 



Lemma 11. All longest s, t-paths of D' are also longest s, t-paths in G' . 

We have to omit the proof. 

Lemma 12. The maximum clique transversal problem of a comparability graph 
G can be reduced to the minimum s,t-vertex separating problem in G* . 

We have to omit the proof. By |^, we have: 

1. It takes 0{my/n) time to compute the number of maximum vertex disjoint 
s, t-paths of G* (see, e.g., |]T^). 

2. The number of maximum vertex disjoint s, t-paths of G* is equal to the 
number of minimum s,t-vertex separators in G* by Monger’s theorem. 

We obtain the following result. 

Theorem 9. Given a comparability graph G = {V,E) with \V\ = n and \E\ = 
m, the maximum clique transversal number of G can be computed in 0{my/n) 
time. 
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8 Distance Hereditary Graphs 

Recall the definition of a distance hereditary graph: 

A graph G = (V, E) is called distance hereditary if each pair of vertices are 
equidistant in every connected induced subgraph containing them. 

The following characterization, using so-called twin sets, appeared in [2]. 

Theorem 10. Distance hereditary graphs can be defined recursively as follows: 

1. A graph consisting of one vertex is distance hereditary. The twin set is the 
vertex itself. We call this a start operation. 

2. If Gi and G 2 are distance hereditary, then the union G of them is again 
distance hereditary. The twin set is the union of the twin sets ofGi and G 2 . 
We call this the false twin operation. 

3. IfGi and G 2 are distance hereditary, then the “join” obtained by connecting 
every vertex of the twin set of Gi with the twin set of G 2 is again distance 
hereditary. The twin set is again the union of the twin sets of Gi and G 2 . 
We call this the true twin operation. 

4 . If Gi and G 2 are distance hereditary, then G obtained by connecting every 
vertex of the twin set of Gi with the twin set of G 2 is again distance hered- 
itary. The twin set is the twin set of Gi . In this case we say that G 2 is 

attached to Gi. 

For each of the operations in Theorem[^we show how to compute an optimal 
maximum clique transversal set. 

We maintain for each distance hereditary graph G = (V, E) with twin set T 
five parameters; (w, h,ujt, ht, h'). We call this set of parameters the character set 
of G. These parameters represent the following: 

LO and h are the clique number and maximum clique transversal number of G. 
LOt and ht are the clique number and maximum clique transversal number of 
G[T], i.e., the subgraph induced by the twin set T. 
h' is the cardinality of a smallest set of vertices such that this set can hit the 
maximum cliques of G[T] and G. 

Lemma 13. If G consists of one vertex, then (uj,h,ujt,ht,h') = (1,1, 1,1,1) 

Lemma 14. Assume G is obtained by a false twin operation of Gi and G 2 . 
Let T\ and T 2 be the respective twin sets of G\ and G 2 . Hence T = Ti UT 2 . Let 
{oji, hi,ujt^, ht^, h'f) be the parameter set for Gi {i = 1, 2). Since G is the (disjoint) 
union of G± and G 2 in this case, we can use the formulas for the union and join 
to obtain lo and h. Also the twin set T is the disjoint union ofTi and T 2 . Hence 
ujt and ht can be also be computed using the formulas for the union and join. 
For the parameter h' we have to consider 4 cases. 

1. = W (2 and ivi = UI 2 . Then h' = h'l -f /i^. 

2. = tvt 2 cind uji > tU 3 -i (i = 1,2). Then h' = h)-\- ht 2 _i- 

3. uJti > and wi = o ;2 (* = 1,2). Then h' = h'^-\- h^-i. 
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4 - ujti > ^t3-i o^nd u>i > u>3-i {i = 1 , 2 ). Then h' = h'^. 

5 . and u>i < u>3-i {i = 1 , 2 ). Then h' = h^. + /i3_i- 

Proof. We show that the 4 ^^^ case is true. According to the definition of h' , h' is 
the smallest set of vertices such that this set can hit maximum cliques in G\T] 
and G. Since and uji > maximum cliques in G\T] and G occur 

in G\Ti\ and Gi, respectively. Hence h' is equal to /i' in this case. 

□ 

Lemma 15. Assume G is obtained by attaching G2 to Gi. Since T = Ti; 
LOt = Wti and ht = ht^ ■ For the parameters uj and h we have to consider 5 cases. 

1. wtj + > max(wi, W2) . Then to = + tot2, and h = Tnm.{ht^,ht2). 

2 . + tjJt2 = 0J3-i > oJi. Then uj = oJs-i and h — imn(h'^_^, hf^ + hi), 

ujti + ujt2 = uji = 0J2. Then to = uji = LO2 and h = min(/i'j^ + /12, ft-2 + ^i)- 

3 . iOti + ^t2 < OJi > uj3-i. Then uj = uji and h = hi. 

4. iOti + 0Jt2 < 0J2 = 0J\. Then uj = oji = 102 and h = h\ + h2. 

For the parameter h' we consider 4 cases. 

1 . uJti + 0Jt2 > max(wi,W2). Then h' = ht^. 

2 . uJti + (jJt2 < oji > i02- Then h' = h'l. 

3 . 0 Jt\ F 0Jt2 F 0J2 F 0J\. Then h^ = h^^ -t- h2. 

4. iOti + ^t2 Foji = 0J2. Then h' = h'l + /12. 

We omit the proof of this lemma. 

Lemma 16. Assume G is obtained by a true twin operation of G\ and G2. 
Since T = TiU T2, we have u>t = ojt^ + u>t2 o,nd ht = mm(ht^, ^42)- ^^6 P^~ 

rameters uj and h we have exactly the same cases and formulas as in Lemma \T 5 \ 
For the parameter h' we have to consider 3 cases. 

1 . ujtt + ujt2 > max(o;i,a;2). Then /i' = min(/itj, 

2 . uJt3 + ujt2 < uJi = max{uji,uj3-i) (i=l, 2 ). Then h' = imn(h),hi + ht^_J. 

3 . uJt^ + ujt2 < uji = UJ2. Then h' = min(/i'j^ + /i 2 7 /12 + ^i)- 

The proof of Lemmadlis similar to the proof of LemmaHH We finally obtain 
the following result: 

Theorem 11. If G = {V,E) is a distance hereditary graph, then the maximum 
clique transversal number can be obtained in linear time. 

9 Conclusion 

Courcelle, Engelfriet and Rozenberg introduced the concept of cliquewidth in 
1993 . The family of graphs of cliquewidth at most k can be defined by k- 
expressions based on graph operations which use k vertex labels. As mentioned 
before, both cographs and distance hereditary graphs are contained in the graph 
family of cliquewidth at most 3 . In Section |S] we have solved the maximum clique 
transversal problem for distance hereditary graphs in time 0 {n + m). However 
until now it remains open to compute the maximum clique transversal problem 
for graphs with bounded clique- width at most k. For further research, it is worth 
studying. 
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Abstract. Every graph is the edge intersection graph of subtrees of 
a tree. The tree-degree of a graph is the minimum maximal degree of 
the underlying tree for which there exists a subtree intersection model. 
Computing the tree-degree is NP-complete even for planar graphs, but 
polynomial time algorithms exist for outer-planar graphs, diamond-free 
graphs and chordal graphs. The number of minimal separators of graphs 
with bounded tree-degree is polynomial. This implies that the treewidth 
of graphs with bounded tree-degree can be computed efficiently, even 
without the model given in advance. 



1 Introduction 

An intersection model of a graph can be very useful to solve many problems 
that seem to be hard when the graph is given without the model. Examples of 
these problems are the maximum clique, treewidth, minimum fill-in, clique cover- 
ing numbers, rankings, clique transversal, independent set, (vertex-)separators, 
(see, e.g., 0). It is well-known that for every graph there exists an intersection 
model, which is the intersection of subsets of a set [Hj. Unfortunately, this kind 
of model is too general to be useful to solve algorithmic problems. More appro- 
priate are geometric intersection models. Well-known classes of graphs for which 
geometric intersection models exist are planar graphs, interval graphs, permuta- 
tion graphs, cocomparability graphs, circle graphs, circular arc graphs, unit disk 
graphs, string graphs, etc. For graph classes not defined here explicitly we refer 
to the overview and the classical work [TO] . 

Given a graph, finding the intersection model is often not an easy task, 
because this implies a recognition algorithm for that class. Well-known examples 
are unit disks graphs [7], string graphs [1^ and EPT-graphs [T^. For each of 
these classes the recognition is NP-hard. 

One of the oldest examples of graphs with a nice intersection model is the 
class of chordal graphs. Chordal graphs are defined as those graphs without any 
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chordless cycle of length at least four. One characterization of chordal graph is 
the following [^: A graph is chordal if and only if it is the intersection graph 
of subtrees of a tree. In this intersection tree model of chordal graphs, we only 
require that the the intersection of two subtrees is at least one common vertex. In 
this paper we investigate intersection graphs of subtrees of trees where vertices 
are adjacent if they share at least one edge (or line) of the underlying tree. For 
clarity, we shall refer to vertices and edges of the underlying tree as points and 
lines, respectively, in the rest of the paper. 

2 Preliminaries 

Let G = (V, E) be a graph. As usual, we use n = \V\ and m = \E\. For a vertex 
X £ V we denote by G — a; the graph G[F \ {x}]. For a subset ^ C we use 
G — S' to denote the induced subgraph G\V \ S]. A clique is a complete subgraph 
of G. We say that a clique covers an edge if the edge belongs to the clique. 

Definition 1. A clique covering of G is a family of cliques such that each edge 
of G is in at least one clique of the family. The minimum cardinality of such a 
family is the clique covering number, denoted by cc(G). 



Definition 2. Given a tree T and a collection {T^ : x € V} of subtrees we 
construct a graph (V,E) such that {x,y} £ E if and only if and Ty have at 
least one line in common. We call a graph which can be represented in this way 
an edge-intersection graph of subtrees in a tree and the pair (T, {Tj, : x £V}) 
is called an edge-intersection model. 



0 (0 (0 

© © ® 




Fig. 1. The domino graph and an edge-intersection representation. 



Lemma 1. Every graph is an edge-intersection graph of subtrees of a star 
with maximum degree cc(G). 

Proof. Let G = (V,E) be any graph and let the edges of G be covered by 
k = cc(G) cliques. Let T be a star ATi^fc with central point c. We use a bijection 
from the cliques in the cover to the points of T — c. For each vertex x £ V let 
Tx be the subtree of T induced by c and the points corresponding with cliques 
containing x. Clearly, two vertices x and y are adjacent in G if and only if 
and Ty share a line of T. □ 
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Clearly, not every graph is chordal, but, as shown in Lemma[Tl every graph 
is an edge-intersection graph of some tree. In this paper we are interested in the 
minimum possible value of the maximal degree of such an underlying tree. 

Definition 3. Let G = {V, E) be a graph. The minimum value e for which there 
exists a tree T with maximal degree e such that G can be represented as the 
edg e-inters eetion of subtrees ofT is called the tree-degree of G denoted by e(G). 

Remark 1. It is easy to see that e(G) = 0 if and only li E = 0. For connected 
graphs G = {V, E) we have e(G) = 1 if and only if G is complete and E ^ 0. 
On the other hand e(G) = max{2,e(Gi) : 1 < i < k} holds for graphs G with k 
connected components Gi = (Vi, Ei) such that Ei ^ 0 for 1 < i < k, k > 2. 

If X is an isolated vertex of G then e(G) = e(G — x) because we can extend 
an arbitrary representation of G — x as edge-intersection graph to a model of G 
by choosing any single point as subtree representing x. If x G C is adjacent to 
all vertices in 14 \ {x} then e(G) = e(G — x) because we can extend every model 
of G — X by subtrees of T to a representation of G by choosing the whole tree T 
as a subtree representing x. Finally we have e(G — x) < e(G) for all x G F. 

If G' is the graph obtained from G by shrinking the edge {x, y} to a single 
vertex z then e(G') < e(G) because we can define to be the union of and 

Ty. 

The tree-degree of a graph can be the number of edges, e.g. for chordless 
cycles of length at least four or complete bipartite graphs, see Corollary |T] 

Lemma 2 . G is an interval graph if and only if e(G) < 2. 

Proof. Let (Gi, G 2 , . . . , G^) be a consecutive clique arrangement of an interval 
graph G = (V,E) [5]. We consider a path T with points Pi, 0 < i < k and lines 
{pi-i,Pi}, 1 < i < k. For each vertex x G F let be the subpath of T induced 
by the set {pi-i,pi : x G Gi}. Then G is exactly the edge-intersection graph of 
the subpath of the path T. 

Now let G be a graph with e(G) < 2. We consider a corresponding represen- 
tation of G as edge-intersection graph of subpaths of a path T. Obviously 
{x : p, 9 G V(Taf)'\ is a clique of G for each line {p, q} of T. Since we may assume 
that G does not contain isolated vertices each subpath contains at least one 
line. Hence the lines of T define a consecutive clique arrangement of G. □ 

Lemma 3. G is a chordal graph if and only if e(G) < 3. 

Proof. Chordal graphs are exactly the vertex-intersection graphs of subtrees of 
a tree jU]. For an arbitrary chordal graph G = (V,E) we extend any vertex- 
intersection model (T, {T^ : x G 14}) to an edge-intersection model as follows: 

1. To each point p of T we stick an additional pendant leaf p' and extend all 
subtrees T^, containing p to p' . 

This way we obtain an edge-intersection model for G. 
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2. Each inner point p of degree d{p) > 4 we split it into d{p) — 2 points 
Pi) ■ • • )Pd(p )-2 connected in a path-like fashion by d{p) — 3 additional lines. 
Two neighbors of p become adjacent to pi and Pd(p)- 2 > respectively, and the 
remaining neighbors are matched to p 2 , ■ ■ ■ ,Pd{p)- 3 - All subtrees containing 
p are modified in the same way. 

The latter operation preserves the modeling property while decreasing the 
maximal degree of a point to at most 3. 

Now let G be a graph that is not chordal. Hence G contains a chordless cycle 
of length fc > 4. By Corollary [T] the tree-degree of G is at least k. □ 

One of the basic results that we need is a bound on the number of points in 
the underlying tree. 

Lemma 4. For each graph there is an edge-intersection representation on an 
optimal tree with at most |m points. 

Proof. A tree is optimal for the graph G if its maximal degree is e(G) and there is 
a collection of subtrees representing G. Let us call a line high if it is incident with 
two branch points (these are points of degree at least three) and low otherwise. 
If two subtrees share a line, we say the line contains the edge connecting the 
two vertices corresponding to the two subtrees. 

We consider a low edge of an optimal tree T with minimum number of points. 
If all edges contained in this line are also contained in some other lines we can 
contract it without changing the maximal degree. Moreover the new tree still 
represents the same graph. Hence every low line of T contains an edge contained 
in no other line. 

This means that the number of low edges in T is at most m, especially the 
number of leaves is at most m. Since the number of leaves in a tree is at least 
twice the number of branch points the latter number is bounded by |m. This 
implies that the number of high lines is at most — 1. Since the total number 
of lines is bounded by |m — 1 the tree T contains at most |m points. □ 

3 Complete Separators in Edge-Intersection Graphs 

Let {T,{Tx : X G E}) be an optimal edge-intersection model of G = (V,E) such 
that the number of points in T is minimal. We consider a line i = {a, &} of T such 
that neither a nor 6 is a leaf. Removing i from T results two disjoint subtrees 
T“ and T^. Both T“ and contain at least one subtree for a suitable vertex 
X. Otherwise we can remove either T“ — a or — b from T resulting a smaller 
model. Consequently, {x : contains F] is a complete separator of G. Therefore 

every internal line of T corresponds to a complete separator. This leads to the 
following lemma: 

Lemma 5. If a graph has no complete separator, then it has an optimal edge- 
intersection tree which is a star. 
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(a, b, c, d) b, c,d'^ 

(^a,b, c,d') — (g, b,c,d, a, b, c,d ^ — («, b, c, cj^ 

©CD©® ( c® (7,b, c,d) 

Fig. 2. The graph 47^2 and a representation of its complement. 



Theorem 1. For all graphs G holds e(G) < cc(G) with equality if G does not 
eontain a eomplete separator. 

Proof. By Lemma [1] we have e(G) < cc(G). To prove equality, assume that 
G has no complete separator. By Lemma |5] there is an optimal representation 
of G as edge-intersection graph of a star iLi.fc. Clearly each line of this star 
corresponds with a clique of G, and these cliques cover all edges of G. It follows 
cc(G) <k = e(G). □ 

Corollary 1. = i ■ j and e(Gfc) = k for k > 3. 

Proof. It is easy to see that neither Kij nor Gfc contain a complete separator, 
and cc{Kij) = i ■ j and cc(Gfe) = fc for A: > 3. □ 

Lemma 6. Let S be a complete separator of G. Then M < e(G) < M +1 holds 
where M = max{e(G[G U S]) : G[G] is a connected component of G — S'}. 

Proof. Obviously the lemma is true for chordal graphs G. Moreover, M < e(G) 
holds because G[G U S] is an induced subgraph of G. 

On the other hand, let G^, . . . , G'^ be the vertex sets of the connected compo- 
nents of G — S. We consider optimal models (T®, {T} : a: G G® U S|) of G[G* U S] 
for i = 1, . . . , d. Each tree T* contains a point p* that belongs to all subtrees T}, 
s G S. We create an additional point g* adjacent to p* for each i and construct 
a tree T by linking the trees T® by a path (g^, g^, . . . , g'^). The subtrees T} for 
s G S are linked in the same way to create Tg. All other subtrees T^, x ^ S are 
defined to be T} if a: G GL 

It is easy to see that (T, {T^, : a: G Ej) is an edge-intersection representation 
of G and Z\(T) < M-k I. □ 

Remark 2. An interesting special case with respect to Lemma 0 is that each 
subtree T* contains a line P that belongs to all subtrees T}, s G S. Under this 
circumstances we can subdivide P by an additional point p®. Proceeding as in 
the above proof we obtain e(G) = M. 

However, in general such optimal models (T*, {TJ : x G G*U5'|) do not exist. 
In Figure we give a graph O' obtained by adding two simplicial vertices to 
opposite faces of the octahedron O. Since no minimum edge clique cover of O 
contains opposite triangles we have e(0) = 4 but e(O') = 5. 

Lemma 7. Let S be a complete separator of G = (V,E). Then there is an 
optimal edge-intersection model (T, {T^ \ x gV}) of G such that T has a line £ 
with S = {x : Tx contains £}. 
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Fig. 3. The octahedron graph extended by two simplicial vertices. 



Proof. Let {T,{Tx : a; S F}) be any optimal edge-intersection model of G and 
let C be the vertex set of a connected component of G — S'. Since S is complete 
there is a point p contained in all subtrees Tg, s G S. We split p into two points 
Pi and p 2 joined by the line £. A neighbor q oip becomes adjacent to pi if there is 
a vertex c € C such that Tc contains q, and q becomes adjacent to p 2 otherwise. 
All subtrees x & V are modified accordingly. Especially for c G C the new 
subtree does not contain p 2 since c has neighbors in G U S only. Similar, for 
d G F \ (G U S) the new subtree does not contain pi since d has no neighbors 
in G. Finally, for s G S the new subtree contains both pi and p 2 - The latter 
property ensures S' = {a; : contains £}. 

Obviously splitting p into pi and p 2 does not increase the maximum degree 
of the host tree. Moreover the new model (T', {T^ : x G V}) represents the same 
graph G. □ 

4 Minimal Separators in Edge-Intersection Graphs 

In this section we locate the minimal separators in an edge-intersection graph. 
A minimal (vertex) separator is minimal for two non-adjacent vertices, see |10| 
(page 82) for a precise definition. S is a minimal separator of G if and only if 
G — S has two connected components such that each vertex in S has a neighbor 
in both of them. 

Definition 4. Let G = (V, E) be a graph and let S be a separator of G. Then 
G <ZV is a full eomponent of S in G if every vertex in S has a neighbor in G . 

Remark 3. Obviously a set S' C E is a minimal a, 6-separator of G = (V, E) if 
and only if a and 6 are vertices in different full components of G — S. 

We proceed with a lemma similar to Lemma i For W C E let G I W be the 
graph obtained from G = (V,E) by shrinking W to a single vertex. 

Lemma 8. Let S be a eomplete minimal separator of G = (V,E). Then 
e(G) = max{e(G | (F \ (S U G))) : G[G] is a eonnected eomponent of G — S}. 

Proof Let G(G) be shorthand for G | (F \ (S U G)). Since S is a minimal 
separator there is a full component D G of G — S. Hence G(G) is the graph 
G[G U S] augmented by a vertex d with N{d) = S. This implies e(G) > e(G(G)) 
because we obtain G{C) from G in two steps: first we remove all components of 
G — S except G and D and then we shrink D to the single vertex d. 
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Let C^, . . . , be the vertex sets of the connected components of G — 5'. To 
show e(G) < max{e(G(C®)) : 1 < i < d} we consider optimal models (T*, {T* : 
a; € G* U S'}) of G(G®) for i = 1, . . . , d. Since S is a complete separator of G(G*) 
Lemma [T| ensures the existence of a line in T* that belongs to all subtrees Tg 
for s G S. Now we can follow the lines of Remark to complete the proof. □ 

Remark 4- The assertion of the lemma remains true if S is a complete separa- 
tor that is not minimal because every complete separator contains a complete 
minimal separator. 

Next we consider the number of minimal separators in a graph of bounded 
tree-degree. Let S be any minimal a, ^-separator, and let Gq and be the full 
components of S containing a and b, respectively. Consider the two subtrees T' 
and Tj formed by the union of subtrees corresponding to vertices of Go and Ct, 
respectively. Hence and are two subtrees of T having at most one point 
in common, otherwise some vertex of Ca would be adjacent to some vertex in 
Cl,. There is a unique shortest path starting from a point p of to a point q of 
Tl such that the path does not pass any line of nor T^. Notice that p maybe 
equals q. Let La (resp. Lt) be the set of lines of T'a (resp. T^) incident with p 
(resp. q). By Remark [3l S corresponds to the set of subtrees containing a line of 
La and a line of L(,. Therefore every subtree Tg with s G S contains a line in La 
and another line incident with p but not in La- Any subtree containing a line 
of La and another line incident with p but not in La must be in S. Otherwise 
it should be in Ga, a contradition to the definition of La- Similarly, any subtree 
containing a line of Li, and another line incident with q but not in Lj, must be 
in S. 

Consider an edge-intersection graph G = {V,E) and the intersection model 
of subtrees of a tree T with maximum degree e = e(G). Consider two subtrees 
and Th that do not share any line of T, i.e., a and h are non-adjacent vertices of 
G. Following the obsevation above the number of subsets of lines incident with 
a point of T is an upper bound of the number of minimal separators. For each 
internal point of T, the number of subsets of lines incident with it is at most 2^ . 
There are at most |m internal points. Since each separator correspondes with 
two complementary subsets of lines incident to a point the number of minimal 
separators is bounded by 3m • 2^“^. 

Theorem 2. There are at most 3m minimal separators in a graph G with 
m edges and e = e(G). 

5 Treewidth 

In this section we show that we do not need the intersection model to compute 
the treewidth of a graph with bounded tree-degree. 

Theorem 3. There exists a polynomial time algorithm to find the treewidth of 
any edge-intersection graph with bounded tree-degree. 
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Proof. First we use the algorithm of [2] to list all minimal separators in a graph. 
In the previous section we have shown that the number of minimal separators 
in a graph with bounded tree-degree is linear, hence the listing algorithm runs 
in polynomial time. 

Having the complete list of all minimal separators, we make use of the result 
of 1^ which states that treewidth can be computed in polynomial time for 
any graph with a bounded number of minimal separators. □ 

A more or less trivial upper bound for the treewidth of a graph with bounded 
tree-degree is the following. 

Lemma 9. If G = {V,E) is an edge-intersection graph, then the treewidth is 
hounded by e{G) ■ to{G), where oj{G) is the cardinality of a maximum clique of 
graph G. 

Proof. Consider an underlying tree T which models G with maximum degree 
e(G). Take the chordal embedding of G by making two vertices of H adjacent if 
they have at least a point of T in common. The maximal cliques of H correspond 
with the subtrees of T that share a point in T. Consider a point p in T. Every 
line incident with p is contained in at most to{G) subtrees. There are at most 
e(G) lines in T incident with p. □ 

6 Complexity of Computing the Tree-Degree of a Graph 

In this section we show that computing the tree-degree is NP-complete even for 
planar graphs. We first show that computing cc(G) for planar graphs G is NP- 
complete and then reduce this problem to the problem of computing tree-degrees 
for planar graphs. 

It is well known that the problem clique cover 

INSTANCE: Graph G = (V,E), positive integer k. 

QUESTION: Is cc(G) < fc? 

is NP-complete I15I19I13I for general graphs. This problem was posed by Erdos, 
Goodman and Posa [8] and somewhat later also by Lovasz m- The parameter 
cc(G) is computable in polynomial time for chordal graphs G |18| . for graphs 
with maximum degree less than 6 [m, and for line graphs pW)j . It cannot 
be approximated within a factor less than two for graphs in general m unless 
P=NP. 

In the following we first show that the problem clique cover is NP-complete 
for planar graphs. A triangulation of the plane is a planar graph where every 
face is a triangle. It is easy to see that every triangulation of the plane is 3- 
connected. If G is a triangulation of the plane without any triangle separator 
(i.e. every triangle of G is a face) then G is Ai 4 -free except G = AT 4 . Therefore 
such a triangualtion does not contain any complete separator. 

In [22] Uehara considers the problem independent triangle set defined 
as follows: 
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INSTANCE: Graph G = (V, E) and integer k. 

QUESTION: Does G contains k triangles such that no two of them have 

an edge in common? 

His proof shows that the problem independent triangle set is NP- 
complete, even if restricted to triangulation of the plane without any triangle 
separator. In a triangulation of the plane that is not K 4 and contains no triangle 
separators, every edge is contained in two triangles and we can let all cliques in 
any clique cover be triangles. Suppose G is such a triangulation of the plane. Let 
T be the set of all triangles of G. It is easy to see that G contains k triangles such 
that no two of them have an edge in common if and only if G has a clique cover 
of |T| — k triangles. Therefore the problem clique cover remains NP-complete 
even if restricted to triangulation of the plane. 

Let the problem tree-degree be defined as follows: 

INSTANCE: Graph G = (V,E) and integer k. 

QUESTION: Is e(G) < fc? 



Theorem 4. The problem tree-degree is NP-complete, even if restricted to 
triangulations of the plane. 

Proof. By the NP-completeness of clique cover and Theorem E] □ 

7 An Approach to Computing the Tree-Degrees 
for Some Classes of Graphs 

We restrict our attention to chordal graphs first. By Remark[T]and Lemmas[2]and 
|3]we can compute the tree-degree of a chordal graph in linear time because both 
interval graphs and chordal graphs can be recognized in linear time. Moreover, 
the remark and the two lemmas characterize all graphs with tree-degree at most 
three. This allows us to concentrate on graphs with tree-degree at least four in 
the following algorithms. 

Our next algorithm is based on Remark El Assume we are given a graph G 
without complee separators. By Theorem [T]we know e(G) = cc(G), so we have to 
ensure that we are able to compute cc(G). Now assume G has a complete separaor 
S. To apply RemarkElwe have to construct for each component G[G] of G — 5” a 
model of G[GUS'] such that all vertices in S are present at a common line. Both 
are possible for diamond-free graphs. (A diamond is the graph obtained from 
K 4 by removing one edge.) We say that G is c-connected if G does not contain 
a complete separator. 

Algorithm TF 

Input A diamond-free graph G = (V,E). 

Output e(G). 

1. Compute the set S of all inclusion-maximal complete separators of G. 
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2. Compute the set B of all maximal c-connected subgraphs of G. 

3. Construct a tree T with nodes 5 U and arcs G"} whenever S C V' for 
G' = {V',E'). 

4. For each G' G B compute the set C of all maximal cliques and construct an 
edge intersection representation on a star Ki \c\- 

5. For each separator ^ G 5 do 

(a) Create a path on d-riS) points. 

(b) For each G' adjacent to iF in T choose a line £ of the tree representing 
G' that is a line of Tg for all s G S. Subdivide £ by an additional point. 

(c) Link these subdivision point to the point of the path in a 1-1 manner as 
shown in Figure |H 

6. Output e(G) = max{|C| : G' G C} and stop. 




Fig. 4. Linking stars. 



The correctness of this algorithm relays on Theorem [T] Remark |2] and the 
following simple fact. 

Lemma 10. A graph G is diamond-free if and only if every edge of G belongs 
to a unique maximal clique of G. 

This lemma implies that the set of maximal cliques is the unique edge-clique 
cover of a diamond-free graph. This set can be computed in linear time. More- 
over, every complete separator is contained in a maximal clique. 

Theorem 5. The tree-degree of diamond-free graphs can be computed in time 
0{nm). 

Proof. What remains to prove is the time bound. We apply Tarjan’s algorithm 
to find all complete separators in time 0{nm). Steps to [6] can be executed 
within the same time bound. □ 



Remark 5. The same algorithms works for outerplanar graphs too. 
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Abstract. The constrained minimum vertex cover problem on bipar- 
tite graphs arises from the extensively studied fault coverage problem 
for reconfigurable arrays. In this paper, we develop a new algorithm for 
the problem, in which classical results in matching theory and recently 
developed techniques in parameterized computation are nicely combined 
and extended. The algorithm is practically efficient with running time 
bounded by 0(1.26*^ -I- kn), where k is the size of the constrained min- 
imum vertex cover in the input graph. The algorithm is a significant 
improvement over the previous algorithms for the problem. 



1 Introduction 

As the density of VLSI chips increases, the probability of introducing defects 
on the chips during the fabrication process also increases. A number of recon- 
figuration strategies have been proposed. In particular, algorithms for repair of 
reconfigurable arrays using spare rows and columns have received considerable 
attention |7| 0 | 16|17 |. A typical reconfigurable array consists of a rectangular ar- 
ray plus a set of spare rows and spare columns. A defective element is repaired 
by replacing the row or the column containing the element with a spare row or a 
spare column. In general, the number of spare rows and columns is much smaller 
compared with the total number of rows and columns in the reconfigurable array, 
and the cost of reconfigurating an array is proportional to the number of replace- 
ment rows and columns. Therefore, it is desirable to use the minimum number 
of spare rows and columns to repair the defective elements, under the constraint 
that the numbers of replacement spare rows and columns do not exceed the total 
numbers of the available spare rows and columns, respectively. 

Formally, let A be an n x m rectangular reconfigurable array. A fault covering 
of A consists of a set Sr of rows and a set Sc of columns in A such that every 
defective element in A is either in a row in Sr or in a column in Sc (or both). The 
fault covering is minimum if | S'r | -I- 1 5c | is the minimum. Suppose the array A is 
also provided with kr spare rows and kc spare columns. Then the problem is to 
either find a minimum fault covering (5c, Sc) of A with |5c| < kr and |5d < kc, 
or report no such fault covering exists. 

* This work is supported in part by NSF under Grant CCR-0000206. 



A. Brandstadt and V.B. Le (Eds.): WG 2001, LNCS 2204, pp. 55 4651 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 



56 



Jianer Chen and lyad A. Kanj 



This problem can be easily converted to the constrained minimum vertex 
cover problem on bipartite graphs, (shortly MiN-CVCB), as follows. Let Ga = 
([/ U L, E) be a bipartite graph with U = {mi, . . . , m„} and L = {z)i, . . . , Vm}, 
such that there is an edge between Ui and Vj if and only if the {i,j) element in 
the array A is defective. It is fairly easy to see that a vertex set K of k\ vertices 
in U and k 2 vertices in y is a vertex cover of the graph Ga if and only if the 
corresponding ki rows and k 2 columns make a fault covering of the array A. 
Therefore, the problem can be formally stated as follows: 

Min-CVCB 

Given a bipartite graph G = (JJ U L,E) and two integers and ki, 
either find a minimum vertex cover of G with at most ku vertices in U 
and at most ki vertices in L, or report no such a vertex cover exists. 

An extensive list of algorithms has been published in the literature for the 
Min-CVCB problem and its variations (e.g., iriDU bur! ). Most of these algo- 
rithms are heuristic and have no guaranteed performance. An algorithm pro- 
posed in [Zj introduced the concept of critical set and used branch-and-bound 
technique based on the A* algorithm in Artificial Intelligence m- No explicit 
analysis was given in [Zj for this algorithm, but it is not hard to see that in 
the worst case the algorithm running time is at least 6)(2'^“+^‘ -|- m-^/n). Exper- 
imental results show that this algorithm is much more favourable than previous 
algorithms and the running time is moderate for practical instances for the Min- 
CVCB problem. 

In the current paper, we develop simpler and more efficient algorithms for 
the Min-CVCB problem. Classical results in matching theory and recently de- 
veloped techniques in parameterized computation are nicely combined and ex- 
tended. In particular, we observe that the classical Gallai-Edmonds Structure 
Theorem [8] applied to bipartite graphs will allow us to concentrate on the 
Min-CVCB problem on bipartite graphs with perfect matching. In fact, Gallai- 
Edmonds Structure Theorem is a stronger (and more systematical) version of 
the critical set theory developed in [7] . Gallai-Edmonds Structure Theorem also 
provides a solid basis so that the recently developed technique in parameter- 
ized computation, reduction to problem kernel, can be nicely applied. We then 
use the classical Dulmage-Mendelsohn Decomposition |8] to decompose a bi- 
partite graph with perfect matching into elementary bipartite subgraphs. This 
decomposition makes the other technique, bounded search tree, in parameterized 
computation become much more effective. Combining all these enables us to de- 
rive an 0(1.26^ -I- kn) time algorithm for the Min-CVCB problem, where k is 
the size of a minimum vertex cover in the input graph. To see how significant 
this improvement is over the previous best algorithm |Z], take a typical recon- 
figurable array of size 1024 x 1024 with 20 spare rows and 20 spare columns. A 
simple estimation shows that our algorithm is more than 10® times faster! 

We point out that our results are also of theoretical interests in the current 
study of “efficient” exponential time algorithms for NP-hard problems mm. 
in particular they make a nice contribution to the study of parameterized algo- 
rithms for the minimum vertex cover problem, which has drawn much attention 
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recently |1I3I12| . A recently developed algorithm by Fernau and Niedermeier [6] 
has running time 0(1.40^ + kn) and constructs a constraint (not necessarily 
minimum) vertex cover for bipartite graphs. This algorithm could also be used 
to solve the Min-CVCB problem, but our algorithm is significantly faster and 
much simpler using elegant results in graph theory (instead of lengthy case-by- 
case combinatorial enumerations). 

2 Reduction to Kernel by GE-Theorem 

A vertex set iC in a graph G is a vertex cover if every edge in G has at least one 
end in K . A graph G = (F, E) is bipartite if its vertex set V can be bipartitioned 
into two sets U (the “upper part”) and L (the “lower part”), such that every 
edge in G has one end in U and the other end in L. A bipartite graph is written 
as G = {UU L, E) to indicate the vertex bipartition. The vertex sets U and L are 
called the U-part and the L-part of the graph G. A vertex is a U -vertex (resp. 
an L-vertex) if it is in the [/-part (resp. the L-part) of G. A matching M in a 
graph G is a set of edges in G such that no two edges in M share a common end. 
A vertex v is matched if it is an end of an edge in M and unmatched otherwise. 
The matching M is perfect if all vertices in G are matched. It is well-known that 
for a bipartite graph, the number of edges in a maximum matching is equal to 
the number of vertices in a minimum vertex cover |S] . Given a matching M in a 
graph G, an alternating path (with respect to M) is a simple path {ui, U 2 , . . . , Vr} 
such that vi is unmatched and the edges [v 2 k,V 2 k+i] are in M for all k. An 
alternating path is an augmenting path if it has odd length and both endpoints 
are unmatched. A matching M in a graph G is maximum, if and only if there 
is no augmenting path in G with respect to M |S]. A maximum matching of a 
graph with n vertices and m edges can be constructed in time Oimy/n) [10] . and 
for a bipartite graph G, a minimum vertex cover of G can be constructed from 
a maximum matching in G in linear time jS]. 

We start with a theorem, which answers an open problem posed in j^. 

Theorem 1. The Min-CVCB problem is NP-complete. 

We now show the classical Gallai-Edmonds Structure Theorem (shortly, GE- 
Theorem) 1^ can be applied to significantly reduce the size of the Min-CVCB 
problem. Let G = (V,E) be a bipartite graph. For a subset S of vertices in G, 
let G{S) be the subgraph induced by S. Let D be the set of vertices in G that 
are unmatched in at least one maximum matching, let A be the set of vertices 
mV — D that are adjacent to a vertex in D, and let G = V — D — A. 

Proposition 1. (GE-Theorem) Let G be a bipartite graph with the sets D, A, 
and G defined above. Then (1) D is an independent set in which no vertex is 
contained in any minimum vertex cover for G; (2) A is the intersection of all 
minimum vertex covers for G; (3) the subgraph G(G) has a perfect mathing. 

The sets D, A, and G can be constructed via a maximum matching M in 
the bipartite graph G, as follows. Let W be the set of unmatched vertices under 
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M. Define W to be the set of vertices such that a vertex u is in fD if and only 
if V is reachable from a vertex in W via an alternating path of even length. By 
the definition, W C W. Since exchanging the edges in M and E — M along 
any alternating path from a vertex v' in bb to a vertex v in W will result in 
a maximum matching that does not contain v, we have W Q D. Now for any 
vertex w not in W, we know is an edge in M for some vertex w'. Since 

w is not in W, w' is not connected from any vertex in W by an alternating 
path of odd length. Therefore, after removing w from the graph G, there is no 
augmenting path in G — {■!/;} with respect to the matching M — [ic, w']. Thus, a 
maximum matching in the graph G — {w} contains \M\ — 1 edges. This shows 
that the vertex w must be in every maximum matching in the graph G so w is 
not in D. In consequence, the set W is exactly the set D in Proposition [H With 
the set D known, the sets A and G can be constructed easily. 

Theorem 2. The time for solving an instance (G; ku, kf) o/MiN-CVCB, where 
G is a bipartite graph of n vertices and m edges, is 0{my/n + t{ku + ki)), where 
t{ku + ki) is the time for solving an instance {G'; k'l) for Min-CVCB, where 
k[, < ku, k[ < ki, and G' has a perfect matching. 

Proof. Let G = {UAL, E), and the instance (G; ku, k{) asks for a minimum vertex 
cover for G with at most ku C/-vertices and at most ki L-vertices. We apply the 
algorithm described above to construct the sets D, A, and G in GE-Theorem. 
The algorithm has its time complexity dominated by that of constructing a 
maximum matching in G, which is bounded by 0{my/n). Now let G' = G(G). 
Since the set A is contained in every minimum vertex cover for G and no vertex 
in the set D is in any minimum vertex cover, if the set A contains G-vertices 
and hi L-vertices, then the graph G has a minimum vertex cover with at most ku 
U -vertices and at most ki L-vertices if and only if the graph G' has a minimum 
vertex cover with at most k'u = ku — hu U -vertices and at most k[ = ki — hi 
L-vertices. By GE-Theorem, the graph G' has a perfect matching. □ 

The technique of reduction to problem kernel has been widely used in the 
study of parameterized computation and in the research in fault coverage of 
reconfigurable arrays. Theorem provides a much stronger version for this tech- 
nique than previously proposed versions in both areas. 

Hasan and Liu proposed the concept of critical sets. The critical set of a 
bipartite graph G is the intersection of all minimum vertex covers for G, which 
is, equivalently, the set A in GE-Theorem. However, Hasan and Liu were un- 
able to characterize the structure of the remaining graph when the critical set is 
removed from the graph G. By lack of this observation, they proposed to inter- 
lace the process of branch-and-bound searching and the process of constructing 
the critical set |Z]. On the other hand. Theorem El indicates explicitly that the 
remaining graph G' after removing the critical set A and the independent set 
D has a perfect matching. This fact immediately limits the size of the graph 
G' . Moreover, since every minimum vertex cover for the graph G' contains ex- 
actly one end from every edge in a perfect matching in G' , a careful study can 
show that the construction of critical sets during the branch-and-bound process 
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proposed in 0 is totally unnecessary. This observation can significantly speed 
up the searching process. Finally, having a perfect matching in the graph G' 
will enable us to derive further powerful structure techniques in the searching 
process, which will be illustrated in detail in the next section. 

The reduction to problem kernel technique has been very powerful in design- 
ing efficient parameterized algorithms I1I3I6I12I . Fernau and Niedermeier have 
considered a generalized version of the MiN-CVCB problem (in which the min- 
imization of the vertex cover is not required), and described how to reduce the 
graph size to 2kuh. Theorem enables us to improve this bound to 2(/c„ + ki): 
The facts that the graph G' has a perfect matching and that a minimum vertex 
cover of G' has at most k'^ [/-vertices and at most k[ L- vertices necessarily bound 
the number of vertices in G' by 2{k'^ + k[) < 2{ku + ki). In the development of 
parameterized algorithms for minimum vertex cover for general graphs, Chen, 
Kanj, and Jia [3j proposed to apply the classical Nemhauser- Trotter Theorem 
m that can reduce the problem size to 2k, where k is the size of a minimum 
vertex cover. Similar to GE-Theorem, Nemhauser-Trotter Theorem decomposes 
the vertices of a graph into three parts D' , A', and G', where D' is an indepen- 
dent set. A' is a subset of some minimum vertex cover, and the induced subgraph 
G(G') has its minimum vertex cover of size at least half of G'. When applied 
to bipartite graphs, GE-Theorem, which decomposes the graph into three parts 
D, A, and G, is much stronger than Nemhauser-Trotter Theorem in all aspects: 
the set A is contained in every minimum vertex cover for G while the set A! is 
only contained in some vertex cover for G. In particular, for the application of 
the Min-CVCB problem, the set A' cannot be automatically included in the 
minimum vertex cover being searched. Moreover, since the subgraph G(G) has 
a perfect matching, it follows directly that the minimum vertex cover for G(G) 
contains at least half of the vertices in G. By lack of having a perfect matching, 
the subgraph G(G') also cannot take the advantages we will use in the searching 
process, which will be described in the next section. 

3 Efficient Search by DM-Decomposition 

According to Theorem|2] we can concentrate on instances (G; ki) of the MiN- 
GVGB problem, where G is a bipartite graph that has a perfect matching and 
has at most 2{ku + h) vertices. 

A connected bipartite graph B = [U U L, E) is elementary if every edge of B 
is contained in a perfect matching in B. An elementary bipartite graph B has 
exactly two minimum vertex covers, namely U and L |^. This property makes 
our searching process very efficient on the elementary bipartite graph B: we 
should exclusively include either U or L in the minimum vertex cover. 

The classical Dulmage-Mendelsohn Decomposition Theorem (shortly DM- 
Theorem) |H] provides a nice structure for bipartite graphs with perfect matching 
so that the above suggested searching process can be applied effectively. 

Proposition 2. (DM-Theorem) . A bipartite graph G = {U U L , E) with perfect 
matching can be decomposed into elementary bipartite subgraphs Bi = (Ui U 
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Li,Ei), i = 1, . . . , r, such that every edge in G between two subgraphs Bi and Bj 
with i < j must have its U -vertex in Bi and its L-vertex in Bj. 

The subgraphs Bi will be called bloeks. A block Bi is a d-bloekii \ Ui \ = \Li\ = 
d. Edges connecting two different blocks will be called “inter-block edges” . 

Lemma 1. Let G = {UU L, E) be a bipartite graph with perfect matching. Then 
the decomposition ofG given in Proposition\^can be constructed in time 0(|Ep). 

Proof. An edge e in G is an inter-block edge if and only if e is not contained in 
any perfect matching in G [8]. Consider the algorithm given in Figure [3 



DM-Decomposition 

Input: a bipartite graph G = {U U L, E) with perfect matching 

Output: the blocks Bi, . . ., Br in DM-Theorem 

1. construct a perfect matching M in G; 

2. for each edge e = [u,v] in G do 

if the graph G~ = G — {u, u} has no perfect matching 
then mark e as an “inter-block edge” ; 

3. remove all inter-block edges; 

4. sort the remaining connected components: Bi, . . ., Br, so that every 
inter-block edge connects a U -vertex in a lower indexed block to an 
L-vertex in a higher indexed block. 

Fig. 1. Dulmage-Mendelsohn Decomposition 

Note that an edge e = [u, v\ is in a perfect matching in the graph G if and 
only if the graph G — {u, n} has a perfect matching. Therefore, Step 2 of the 
algorithm DM-Decomposition correctly determines the inter-block edges. By 
DM-Theorem, each connected component after Step 3 is a block for the graph 
G. Now applying the topological sorting in a straightforward way will order the 
blocks so that each inter-block edge in the graph G connects a [/-vertex in a 
lower indexed block to an L-vertex in a higher indexed block. 

Constructing the perfect matching M takes time 0{\E\^\E\) |T0]. Consider 
an edge e = [u, n] in G. If e is in M then obviously e is not an inter-block edge. If 
e is not in M, then after removing the vertices u and v and their incident edges, 
the matching M becomes a matching M~ of |M| — 2 edges in the remaining 
graph G~ . Thus, the remaining graph G~ has a perfect matching if and only 
if there is an augmenting path in G~ with respect to the matching M~ . Since 
testing the existence of an augmenting path can be done in linear time [^, we 
conclude that we can decide whether an edge in G is an inter-block edge in time 
0{\E\). Therefore, Step 2 of the algorithm DM-Decomposition takes time 
0(|Lp), which dominates the complexity of the algorithm. □ 

Now we are ready to describe the main body of our algorithm for the MiN- 
CVCB problem. Let {G;ku,ki) be an instance of the MiN-CVCB problem. 
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where G = {U U L,E) is a bipartite graph with perfect matching, and G has 
at most 2{ku + k[) vertices. We first apply DM-Theorem that decomposes the 
graph G into blocks Bi = {UiULi, Ei), such that every inter-block edge connects 
a [/-vertex in a block of lower index to an L-vertex in a block of higher index. 
It is easy to see that a minimum vertex cover of the graph G must be the union 
of minimum vertex covers of the blocks. Since the only two minimum vertex 
covers of the block are Ui and the constrained minimum vertex cover of 
the graph G with at most [/-vertices and at most ki L- vertices must be the 
union of properly selected [/-parts and L-parts from the blocks. 

The execution of our algorithm is depicted by a search tree whose leaves 
correspond to the potential constrained minimum vertex covers K of the graph 
G with at most ku [/-vertices and at most ki L-vertices. Each internal node of 
the search tree corresponds to a branch in the searching process. Let F(/c„ -|- k[) 
be the number of leaves in the search tree for finding a minimum vertex cover 
of at most ku [/-vertices and at most fc; L-vertices in the bipartite graph G. We 
list the possible situations in which we branch in our search process. 

Case 1. A block Bi is a d-block with d > 3. 

Let Bi = (Ui U Li^ Ei). Since the constrained minimum vertex cover K of the 
graph G either contains the entire Ui and is disjoint from Li, or contains the 
entire Li and is disjoint from Ui, we branch in this case by either including the 
entire Ui in K (and removing Li from the graph) or including the entire Li in K 
(and removing Ui from the graph). In each case, we add at least 3 vertices in the 
constrained minimum vertex cover K. Thus, this branch satisfies the recurrence 
relation: 

F(ku + ki) < 2F(ku + fc, - 3) (1) 

A block sequence [i?( , . . . , B),] is a block chain if there is an edge from the 
[/-part of B( to the L-part of B(j^^ for all 1 < i — 1. Observe that the chain 
implication can speed up the searching process significantly. For example, if we 
include the L-part of the block B[ in the minimum vertex cover K, then the 
[/-part of B[ must be excluded. Since there is an edge in G from the [/-part 
of B[ to the L-part of B' 2 , we must also include the entire L-part of B '2 in K, 
which, in consequence, will imply that the L-part of Lg must also be included, 
and so on. Thus, the L-part of B[ in the minimum vertex cover K implies that 
the L-parts of all blocks in the chain must be in K. Similarly, the [/-part of 
B'f, in K implies that the [/-parts of all blocks in the chain must be in K. This 
observation enables us to handle many cases very efficiently. 

Case 2. A 2-block is an interior block in a block chain. 

Let [B' , B, B”] be a block chain, where B is a 2-block. Then including the 
L-part of B in the minimum vertex cover K forces at least 3 vertices in K 
(the L-parts of B and B”), and excluding the L-part of B also forces at least 3 
vertices in K (the [/-parts of B and B'). Thus, in this case, the branch satisfies 
the recurrence relation m- 

Case 3. The U -parts of three different 2-blocks are connected to the L-part 
of the same 1-block, or the [/-part of a 1-block is connected to the L-parts of 
three different 2-blocks. 
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Suppose that the [/-parts of the 2-blocks B' 2 , and B'^ are connected to 
the L-part of the 1-block B'^. Then including the [/-part of B'q in the minimum 
vertex cover K forces at least 7 vertices (the [/-parts of all the four blocks) in 
K, and excluding the [/-part of Bq forces at least 1 vertex (the L-part of Bq) in 
K. Therefore, the branch satisfies the recurrence relation 



F{ku ki) < F(ku ki — 7) F{ku ki — 1) (2) 



Similar analysis shows that <m is also applicable to the other subcase. 

Case 4. All Cases 1-3 are excluded. 

Because Case 1 is excluded, there are only 1-blocks and 2-blocks. Since Case 
2 is excluded, either the [/-part or the L-part of each 2-block is not incident on 
any inter-block edges. Therefore, we can re-order the blocks as: 



r>f r?/ TDf TDf JDf TDf 

-^15 • • • 5 -^rC5 -^1C+15 • • • 5 • 5 -^2 



( 3 ) 



where B[, . . ., B'^ are 2-blocks whose L-parts are not incident on inter-block 
edges, ■■■, By are 1-blocks, and ■ ■ ■, B'^ are 2-blocks whose [/-parts 

are not incident on inter-block edges, such that every inter-block edge goes from 
the [/-part of a lower indexed block to the L-part of a higher indexed block. 
Note that for any index i, 1 < i < z, the [/-parts of B[, . . B[ plus the L-parts 
of . . ., L' make a minimum vertex cover for the bipartite graph G. Now to 

construct a constrained minimum vertex cover K of the bipartite graph G with 
at most ku [/-vertices and at most ki L- vertices, we have the following situations. 

If ku + ki > k, where k is the size of a minimum vertex cover of the graph 
G, then we can pick an index i such that the [/-parts of the blocks B[, . . ., L' 
consist of either k^ — l or vertices (such an index always exists since all blocks 
are 1-blocks and 2-blocks). Now the [/-parts of the blocks B[, . . ., L' together 
with the L-parts of the blocks B'^ make a minimum vertex cover with 

at most ku [/-vertices and at most k — (ku — 1) < fc/ L-vertices. 

Thus, we can assume that ku + ki = k. 

If 2x < ku < 2x + (y — x), then let i = — x. The minimum vertex cover K 

consisting of the [/-parts of B[, . . ., L' and the L-parts of B(^^, . . ., contain 
(z — x) -I- 2x = ku [/-vertices and k — ku = h L-vertices. Thus, AT is a desired 
minimum vertex cover satisfying the constraint. 

If ku < 2x and ku is even, then the [/-parts of the first fc«/2 2-blocks plus 
the L-parts of the rest of the blocks make a desired minimum vertex cover. 

Now consider the case ku < 2x and ku is odd. If the graph G has no 1- 
blocks, then obviously there is no minimum vertex cover with ku [/-vertices and 
ki L-vertices. Thus we assume that the graph G has at least one 1-block. 

It is easy to see that there is a minimum vertex cover of exactly 1 [/-vertices 
if and only if there is a 1-block whose L- vertex has degree 1, and there is a 
minimum vertex cover of exactly 3 [/-vertices if and only if there is a 1-block 
whose L-vertex is adjacent to the [/-part of at most one 2-block. Thus, we can 
further assume > 5. 

Pick the 1-block B of the minimum index i in sequence ®. Since Case 3 is 
excluded, there are at most two 2-blocks B' and B” with their [/-parts adjacent 
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to the L- vertex of B. Note that since Case 2 is excluded, no inter-block edges can 
be incident on the L-parts of the blocks B' and B" . Now re-order the sequence 
m by removing B, B' , and B" from the sequence then re-inserting B',B”,B 
in the front of the sequence. Since the L-parts of B' and B” are not adjacent 
to inter-block edges and inter-block edges from the L-part of B only go to the 
[/-parts of B' and B” , the new sequence still has the property that every inter- 
block edge links the [/-part of a lower indexed block to the L-part of a higher 
indexed block. Now the [/-parts of the blocks B' , B” , and B plus the [/-parts 
of the next {ku — 5)/2 2-blocks in the new sequence and the L-parts of the rest 
of the blocks give a minimum vertex cover for G with ku [/-vertices and ki L- 
vertices. In case the L-vertex of B is adjacent to the [/-part of fewer than two 
2-blocks, the desired minimum vertex cover K can be constructed similarly. 

Finally, the case ku > 2x + {y — x) can be reduced to the previous case by 
reversing the sequence ©, exchanging the [/-part and L-part of the bipartite 
graph G, and exchanging the numbers ku and k[. 

Thus, in Case 4 the Min-CVCB problem can be solved in linear time. 

4 Putting All Together 

We summarize all the discussions in the algorithm given in Figure |2] 



CVCB-Solver 

Input: a bipartite graph G = {U U L, E) and two integers and ki 
Output: a minimum vertex cover K of G with at most [/-vertices and 
at most ki L-vertices, or report no such a vertex cover exists 

1. for each [/-vertex u of degree > ki do 

include u in K and remove u from G; ku = ku — 1; 

2. for each L-vertex v of degree > ku do 

include v in K and remove v from G; ki = ki — 1; 

3. apply GE-Theorem to reduce the instance so that G is a bipartite 
graph with perfect matching and has at most 2(fe„ + ki) vertices; 

4. apply DM-Decomposition to decompose G into blocks Bi, . . Br\ 

5. while one of the Cases 1-3 in section 3 is applicable, branch accordingly; 

6. solve the instance for Case 4 in section 3. 



Fig. 2. The main algorithm 

Steps 1-2, taking time 0{kn), make immediate decisions on high degree ver- 
tices. If a [/-vertex u of degree larger than ki is not in K, then all neighbors of u 
should be in K, which would exceed the bound ki. Thus, such [/-vertices should 
be automatically included. Similar justification applies to L-vertices of degree 
larger than fc„. After Steps 1-2, the degree of the vertices in the graph is bounded 
by k' = max{fc„, ki}. Since each vertex can cover at most k' edges, the number 
of edges in the resulting graph must be bounded hy k'{ku + ki) < (fc„ -h 
otherwise the graph cannot have a desired minimum vertex cover. 
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Theorem [2] allows us to apply GE-Theorem in Step 3 to further reduce the 
bipartite graph G so that G has a perfect matching. The running time of this 
step is bounded by 0{{ku + kiY). The number of vertices in the graph G now is 
bounded by 2{ku + h). 

Step 4 applies DM-Theorem to decompose the graph G into elementary bi- 
partite subgraphs. By Lemma[I] this takes time 0{{ku + ki)^). 

As we have discussed in Section 3, if any of Cases 1-3 is applicable, we 
can branch in our searching process with the recurrence relations 0 and 0. 
Eventually, we reach Case 4 in Section 3, which can be solved in linear time. 

It is easy to verify that -|- ki) = 1.26^“+^' satisfies the recurrence rela- 
tions o and 0. Thus, the running time of Step 5 in our algorithm is bounded 
by 0(1.26^“+^' (fc„ -I- where the factor -|- ki)'^ is the size of the graph 

resulted from Step 4. Using the general technique given in m, we can further 
reduce the time complexity of Step 5 to 0(1.26^““*'^‘). Combining all these, we 
conclude with the following theorem. 

Theorem 3. The Min-CVCB problem is solvable in time 0(1. 26^" -I- kn). 



5 Concluding Remarks 

Parameterized computation theory has proved very useful in designing practical 
algorithms in industrial applications, in particular for many NP-hard optimiza- 
tion problems. In this paper, we have developed a simple and significantly im- 
proved algorithm for the MiN-CVCB problem, which arises from the extensively 
studied fault coverage problem for reconfigurable arrays in the area of VLSI man- 
ufacturing. Our algorithm is conceptually simple, easy to implement, and nicely 
combines classical graph structural results with the recently developed tech- 
niques in parameterized computation. For typical applications, our algorithm is 
faster than the previous known algorithms by several orders of magnitude. 

Our improvement is on the reduction of the base in the exponential running 
time, which is of great theoretical and practical importance in the study of 
NP-hard optimization problems. For example, reducing the base by 0.01 in the 
parameterized vertex cover algorithm has resulted in upto 60% improvement 
in the running time for the application program used in biochemistry [^. Our 
algorithm reduces the base by more than 0.14 (from 1.4^ to 1.26*^) from the 
previous known algorithms for the Min-CVCB problem. 

We point out that our investigation emphasizes on the practicality of the al- 
gorithms, which includes simplicity (both conceptual and algorithmic) and effec- 
tiveness. The running time of our algorithm could be further improved slightly if 
we examine in more depth the combinatorial structures in a case-by-case exhaus- 
tive manner, which is something that we tried to avoid since it would deprive our 
algorithm from its simplicity and practicality. We might also apply the dynamic 
programming techniques, as illustrated in [3], which will improve the algorithm 
running time to 0(1.247^““*'^' -I- kn). However, this improvement is at the cost 
of exponential space. 
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Abstract. In this work we introduce, characterize, and provide algo- 
rithmic results for (fc, -|-)-distance-hereditary graphs. These graphs can 
be used to model interconnection networks with desirable connectivity 
properties; a network modeled as a {k, -|-)-distance-hereditary graph can 
be characterized as follows: if some nodes have failed, as long as two 
nodes remain connected, the distance between these nodes in the faulty 
graph is bounded by k plus the distance in the non-faulty graph. The class 
of all these graphs is denoted by DH(fc, -|-) By varying the parameter k, 
classes DH(fc, -I-) form a hierarchy that represents a parametric exten- 
sion of the well-known class of distance-hereditary graphs, and include 
all graphs. 



1 Introduction 

A fundamental problem in any parallel or distributed system is the efficient com- 
munication of data between processors. Such efficiency depends on the routing 
scheme defined over the system, that is the set of paths for each possible pair 
of processors. The efficiency of a routing scheme is meanly measured in terms 
of its stretch factor and dilation. The stretch factor (dilation) is the maximum 
ratio (difference) between the length of a path defined by the scheme and the 
shortest path between the same pair of processors. 

In this work we are interested in networks in which routing schemes coincide 
with shortest paths and node failures may occur. Distances are always computed 
by means of shortest paths in the subnetwork that is induced by the non-faulty 
components. In this context, the decrease of the efficiency of the communication 
depends only on the topology of the networks. 

To measure this efficiency degradation, some parameters about the topology 
can be defined. In |2] the authors defined the notion of stretch number, while in 
this paper we introduce the dilation number d{G) of a graph G. It is defined as 
the smallest k such that G G DH(A:,-|-), where a network modeled as a graph 
belonging to the class DH(fc,-|-) can be characterized as follows: if some nodes 
have failed, as long as two nodes remain connected, the distance between these 

* Work partially supported by the Italian MURST Project “Teoria dei Grafi ed Ap- 
plicazioni” . 
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nodes in the faulty graph is at most k plus the distance in the non-faulty graph. 
Elements of DH(fc, +) are called (/c, +)-distance-hereditary graphs. By varying 
the parameter k, classes DH(fc, +): (i) form a hierarchy that represents a para- 
metric extension of the well-known class of distance-hereditary graphs [15], (ii) 
include all the graphs. 

Given the relevance of {k, -|-)-distance-hereditary graphs in the area of com- 
munication networks, our purpose is to provide characterization and algorithmic 
results about the introduced graphs. 

Related works. In literature there are several papers devoted to fault-tolerant 
network design, mainly starting from a given desired topology and introducing 
fault-tolerance to it (e.g., see mm)- 

Papers m present several results about (fc, *)-distance-hereditary graphs, 
that is graphs whose induced distance is bounded by a multiplicative factor k. 
In |12| . a study about similar concepts is performed: they give characterizations 
for graphs in which no delay occurs in the case that a single node fails. These 
graphs are called self -repairing. In [8|, authors introduce and characterize new 
classes of graphs that guarantee constant stretch factors k even when a multiple 
number of edges have failed. In a first step, they do not limit the number of edge 
faults at all, allowing for unlimited edge faults. Secondly, they examine the more 
realistic case where the number of edge faults is bounded by a value £. The corre- 
sponding graphs are called fc-self-spanners and (/c, £)-self-spanners, respectively. 
In both cases, the names are motivated by strong relationships to the concept 
of k-spanners PH. Related works are also those concerning distance-hereditary 
graphs. Distance-hereditary graphs have been investigated to design intercon- 
nection network topologies mmn, and several papers have been devoted to 
them (see P] and references therein). 

Results. First, we formally introduce (fc, -|-)-distance-hereditary graphs and pro- 
vide some preliminary results. An initial characterization is given in terms of the 
dilation number. Then, we remark differences about structural properties be- 
tween (fc, *)-distance-hereditary graphs and (fc, -|-)-distance-hereditary graphs. 
Starting from these observations, we introduce the notion of “twin graph” G* 
of an arbitrary graph G. This graph has the remarkable property that G G 
DH(/c,-|-) if and only if G* G DH(fc,-|-). Thanks to this notion, we are able to 
provide a characterization of graphs G in DH(fc,-|-) based on cycle-chord condi- 
tions of its twin graph G*. 

Since we show that the recognition problem for the new graph classes is Co- 
NP-Complete (for k not fixed), then we investigate in more detail the smallest 
class among the new ones, i.e., class DH(1,-|-). In this context, our main result 
consists of listing all the forbidden induced subgraphs of every G G DH(1, -I-). A 
theoretical consequence of this characterization is that it allows us to show that 
the recognition problem of class DH(1,-|-) can be solved in polynomial time. 

This extended abstract is organized as follows. Notation and basic concepts 
used in this work are given in Section |5] while Section |3] formally introduces 
(fc, -|-)-distance-hereditary graphs and provide some preliminary results. Sec- 
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tion Estates the Co-NP-Completeness result. Section |5] characterizes graphs in 
DH(1, +) and provide a polynomial time recognition for them. Finally, Section 
lists some open problems. 

2 Notation 

In this work we consider finite, simple, loopless, undirected and unweighted 
graphs G = (V, E) with node set V and edge set E. We use standard termi- 
nology, some of which are briefly reviewed here. 

A subgraph of C? is a graph having all its nodes and edges in G. Given a 
subset S of V, the induced subgraph (S) of G is the maximal subgraph of G with 
node set S. |G| denotes the cardinality of V. If x is a node of G, by Nc{x) we 
denote the neighbors of x in G, that is, the set of nodes in G that are adjacent 
to X, and by Ng[x] we denote the closed neighborhood of x, that is Na{x) U {x}. 
G — S is the subgraph of G induced by F \ 5*. 

A sequence of pairwise distinct nodes (xq, xi, . . . , x„) is a path in G if (xi, x^+i) 
€ E for 0 < i < n, and is an induced path if ({xq, . . . , x„}) has n edges. A graph 
G is connected if for each pair of nodes x and y of G there is a path from x to y 
in G. 

A cycle Gn in G is a path (xq, . . . , x„_i) where also (xo,x„_i) € E. Two 
nodes Xi and Xj are consecutive in G„ if j = {i+ 1) mod n or i = {j + 1) mod n. 
A chord of a cycle is an edge joining two non-consecutive nodes in the cycle. 
Hn denotes an hole, i.e., a cycle with n nodes and without chords. The chord 
distance of a cycle G„ is denoted by cd{Gn), and it is defined as the minimum 
number of consecutive nodes in G„ such that every chord of G„ is incident to 
some of such nodes. We assume cd(iJ„) = 0. 

The length of a shortest path between two nodes x and y in a graph G 
is called distance and is denoted by dG{x,y). Moreover, the length of a longest 
induced path between them is denoted by Dg{x, y). We use the symbols Pa{x, y) 
and pg{x, y) to denote a longest and a shortest induced path between x and y, 
respectively. Sometimes, when no ambiguity occurs, we use Pg{x, y) and pG{x,y) 
to denote the sets of nodes belonging to the corresponding paths. 

If X and y are two nodes of G such that dG(x, y) > 2, then {x, y} is a cycle-pair 
if there exist a path pg{x, y) and a path Pg{x, y) such that pg(x, y) nPG(x, y) = 
{x,y}. In other words, if {x,y} is a cycle-pair, then the set PG{x,y) U PG{x,y) 
induces a cycle in G. 

3 Preliminary Results 

In this section we first formally define (/c, -|-)-distance-hereditary graphs and 
then provide some preliminary results. 

Definition 1. Let k be a real number. A graph G = (V,E) is a (fc, -|-)-distance- 
hereditary graph if for each connected induced subgraph G' of G: 

dG'{x,y) < dG{x,y) k, for each x,y G G^ 
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The class of all the {k^+)~ distance-hereditary graphs is denoted by DH(fc, +). 

Notice that the above definition holds for both connected and disconnected 
graphs. 

Definition 2. Let G be a graph, and {x,y} be a pair of connected nodes in G. 
Then: 

1. the dilation number dc{x,y) of the pair {x,y} is given by dc{x,y) = DG{x,y) 
-dG{x,y); 

2. the dilation number d{G) of G is the maximum dilation number over all 
possible pairs of connected nodes, that is, d{G) = max^^. dG{x,y); 

3. T>(G) is the set of all the pairs of nodes inducing the dilation number of G, 
that is, V{G) = {{x,y} \ dG{x,y) = 3(G)}. 

The dilation number can be used to provide a first characterization of graphs in 
DH(fc,+). 

Theorem 1. Let G be a graph. G G DH(fc, +) if and only if d{G) < k. 

Proof. To prove this theorem we show that 3(G) = min{t : G G DH(t, +)}. 

By Definition E] 3(G) = maxf L> g(x, y) — dG{x,y), and then 3(G) > 
DG{x,y) — dG{x,y) for each pair of connected nodes x,y G V. If G' = {V ,E') 
is a connected induced subgraph of G, then 3(G) > dG'{x, y) — dG{x, y) for each 
x,y G V . Hence dG'{x, y) < d{G)-\-dG{x, y) for each x,y G V . By the generality 
of G' , it follows that G G DH(3(G),+). 

By contradiction, let us suppose that there exists an integer t < 3(G) such 
that G € DH(t, +). Let {x, y} G L){G), and G' = {Pg{x, y)). In this case we have 
that dG'{x, y) = Dg{x, y), and hence the relation Dg{x, y) — dG{x, y) = 3(G) > t 
implies that 

dG'{x,y) = DG{x,y) > t-GdG{x,y). 

Then G ^ DH(t, +), a contradiction. □ 

The following two lemmas list some basic properties of (fc, +)-distance-heredi- 
tary graphs. 

Lemma 1. The following facts hold: 

1. DH(0,+) coincides with the class of distance-hereditary graphs: 

2. DH(fc,+) =DH([/cJ ,+); 

3. DH(fci,+) C DH(/c 2 ,+) for each ki < k 2 ', 

4-. If (x,y) G E then dG{x,y) = 0. Hs a consequence: 

- if 3(G) = 0 then Tt[G) contains every pairs of connected nodes of G; 

- if d{G) > 0 then dG{x,y) > 2 for each pair {x,y} G T(G); 

5. if G contains n nodes, then 3(G) < max{0,n — 4}, and for each n G N there 
exists a graph G' such that 3(G') = n; 

6. DH(fc, +) C DH(1 + |, *) for each k > 1. 
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Fig. 1. A graph G and its twin graph G* . Dashed lines represent edges in (see 
Definition\^. 



Lemma 2. The class DH(fc, +) is closed under taking induced subgraphs. 

Proof. Let G be a graph in DH(fc, +), G' be a connected induced subgraph of 
G, and x,y € G' . Relationships DG>{x,y) < DG{x,y) and dG'{x,y) > dG{x,y) 
are straightforward, whereas DG{x,y) < k + dG{x,y) holds because G belongs 
to DH(fc, +). Hence 

DG'{x,y) < DG{x,y) <k + dG{x,y) <k + dG'{x,y) 

and the theorem follows. If G' is not connected the same argument can be applied 
to every connected component of GL □ 

In the remainder of this section, we provide a characterization of (fc,+)- 
distance-hereditary graphs based on cycle-chord conditions. To this purpose, we 
first introduce the notion of twin of a given graph. 

Definition 3. Let G = (V,E) be a graph. The twin graph of G zs o graph 
G* = (V\G*) such that V* = U and E* = U U E^ , where: 

- V^ = {v^ \ vG V}; 

- V^ = {v'^ \v€ V}; 

- E^ = I (u,v) € E}; 

- E^ = {{u^v^) I {u,v)eE}; 

- E^ = {(u\ ■u^), (u^, v^), I (u, v) G E}. 

Then |R*| = 2- |R| and \E*\ = 4 - iGj. Informally, G* is composed by two “copies” 
of G (corresponding to the subgraphs G^ = (V^,E^) and G^ = (V^,E‘^), both 
isomorphic to G) which are joined by edges in E^ (see Fig.[H where edges in E^ 
are represented by dashed lines). 

The name “twin graph” is due to the fact that, for each pair (v^,v'^) of nodes 
in G* such that vi G and V 2 G V^, v"^ is a. false twin of in G*. Two vertices v 
and v' are twins if they have the same neighborhood; we distinguish between false 
twins when N{v) = N{v') and true twins when N[v] = N[v’]. Let G = {V,E) 
be a graph and u gV. Operation y(G, u) (see |2]) extends G by creating a false 
twin of u. The resulting graph is G' = {V U {u'},E U {{u',v) \ v G Ng{u)}). 
Hence, the twin graph G* can be obtained from G by applying operation y(G, v) 
to each node v of G. 
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Lemma 3. Let G = {V,E) be a graph and let S = (VstEs) be an indueed 
subgraph of G* . There exists an induced subgraph T of G isomorphic to S and 
with no false twins if and only if S has no false twins. 

Proof. Omitted. □ 



Lemma 4. G £ DH(fc, +) if and only if G* £ DH(k, +) . 

Proof. =>; If we assume G* ^ DH(fc, +), then there exist a pair of nodes {u, u} £ 
V* such that Dq* (u, v)—dG* {u, v) > k. Let P*{u, v) = (u = ,v\^ , . . . , = v) 

and p*{u,v) = (u = Ug°,uf^, . . . = v), ij,£j £ {1,2} and 0 < j < m, be the 

longest and the shortest induced paths connecting u and v respectively. Then, 
by Lemma[3l P^ = (uq, . . . , v^) and p^ = (ug, u \, . . . , are also induced 
paths in connecting uj = Vg to . Since Dqi (uJ , — dc^ (mJ , ) > 

|P^| — Ip^I = |P*| — |p*| > k, then G^ ^ DH(fc, +). Finally, since G^ is isomorphic 
to G, also G ^ DH(fc, +). 

By hypothesis G* G DH(/c, +) and since G is an induced subgraph of 
G*, then, by Lemma[21 G £ DH(fc, +). □ 



Lemma 5. Let G be a graph such that d{G) > 0. Then, PiG*) contains a 
cycle-pair of G* . 

Proof. Omitted. □ 



Theorem 2. Let G be a graph and k > 1 an integer. Then, G £ DH(fc,+) if 
and only if cd{Cn) > — 1 for each cycle Gn, n > fc + 4, of G* . 

Proof. By hypothesis, G £ DH(fc,+). By contradiction, let us suppose 
that a cycle G„,n > k -h 4, exists in G* such that cd(Gn) < — 1. Let 

(x,vi,V 2 , . . . ,Vg,y,ui,U 2 , . . . Mp),p+(7+2 = n, be the cycle G„, and {vi,V 2 , . ,Vq} 
the set of nodes giving the chord distance of G„; then, da* {x, y) < cd{Gn) + 1 < 
It follows that, since 

, . , , \ n — k 

DG-{x,y) > n-dG‘{x,y) > n 

then 

7 ^ / \ j / \ n - k n - k 

Dg* [x, y) - dG* [x, y) >n — = k. 

This is a contradiction because, by LemmalU G* £ DH(fc,+). 

<^=; By contradiction, let us suppose G ^ DH(fc,+). By Lemma SI G* ^ 
DH(/c,+). Since fc > 1, then d(G*) > 0. Hence, by Lemma SI and Fact SI of 
Lemma [T] there exists a cycle-pair {x,y} £ V{G*) such that dG»{x,y) > 2 and 
inducing a cycle G„ with n nodes. By contradiction hypothesis, we have that 
Dg* {x, y) > k-\- dG* {x, y). The inequality 

n = Dg* (x, y) + dG* {x, y)> k + dG* (x, y) + dG* {x,y) = k + 2 ■ dG* {x, y) 
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implies that dG‘{x,y) < Moreover, since cd{Cn) = dG‘{x,y) — 1, then 

, n - k 

cd{Cu) ^ ^ 1- 

Since this is a contradiction, then G G DH(fc, +). □ 

4 Recognition Problem for DH(fc, +) 

Although Theorems [Hand [2] provide characterizations for (fc, +)-distance-heredi- 
tary graphs, they cannot be used to devise an efficient algorithm to solve the 
recognition problem for the class DH(A:, +). In this section we study this problem 
when k is not fixed. 

Definition 4. Dilation Number Problem: 

Instance; A graph G = (V, E), an integer q > 0. 

Question; 3(G) > q? 

The NP-completeness of this problem can be shown by providing a polynomial 
transformation from the NP-complete problem Induced Path (cf. [T^, GT23). 

Theorem 3. Dilation Number is NP-complete. 

Proof. Omitted. □ 

If we fix /c = 0, then the recognition problem for the class DH(fc, +) can be 
solved in linear time (H. If we consider k not fixed, then the recognition problem 
for the class DH(fc, +) is exactly the complementary problem of Dilation Number. 
As a consequence, the following complexity result can be stated. 

Corollary 1. If k not fixed, the recognition problem for the class DH(fc,+) is 
Co-NP-complete. 

5 Studying Graphs in DH(1, +) 

In the following we investigate in more detail the smallest class among the new 
ones, i.e., class DH(1,+). 

Lemma 6. Let G be a graph not containing the following graphs as induced 
subgraphs: 

- Hn, for each n > 6; 

- cycles Gq with cd^Go) = 1; 

- cycles Gr with cd{Gr) = 1; 

- cycles Gs with cd{Gs) = 1. 

Then, G does not contain, as induced subgraphs, cycles with n > 6 and 
cd{Gn) < 1. 
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Lemma 7 . Let G be a graph and let G* its twin graph. G £ DH(1,+) if and 
only if the following graphs are not indueed subgraphs of G* : 

i. Hn, for eaeh n > 6; 
ii. cycles Gq with cd{GQ) = 1 ; 
ii. cycles Gr with cd{Gr) = 1; 
iv. cycles G% with cd{Gs) = 1 ; 

V. cycles G2i+4 with cd{C2i+4) = i, for each i > 2 . 

Proof. =>.■ Holes Pin, ri > 6, have dilation number at least 2. Cycles with 6, 
7, or 8 nodes and chord distance 1 have dilation number equal to 2, 3, and 4, 
respectively. Cycles C2i+4 with chord distance equal to i have dilation number at 
least 2* + 4 — 2 • {cd{C 2 i+ 4 ) + 1) = 2. Then they are forbidden induced subgraphs 
for every graph belonging to DH(1,+), and, in particular they are forbidden 
both for G and, by Lemma [H for G*. 

<^=; We prove that if G ^ DH(1,+) then G* contains one of the forbidden 
subgraphs. 

If G ^ DH(1,+) then, by Theorem [S] G* contains a cycle G„, n > 6, as 
induced subgraph such that 0 < cd(G„) < In what follows, let us denote 
cd{Gn) as q. 

If g = 0 then we obtain the holes Pin, n > 6. If 1 < g < and n = 6,7, 8, 
then we obtain the other forbidden subgraphs. 

Now, we have to show that each cycle G„ with n > 9 and chord distance 
g such that 1 < g < contains a forbidden subgraph. In this case, let us 
assume the cycle G„ be induced by the nodes of the two node-disjoint paths 
PG{x,y) = {x,ui,U2,...,Up,y) and pa{x,y) = {x,vi,V2, . . . ,Vq,y) such that 
p -|- g -|- 2 = n. In this cycle we denote by rj the largest index j' such that Vj and 
Uj' are connected by a chord of G„, i.e. rj = max{j' | {vj,Uj>) is a chord of G„}; 
we assume rj undefined when Vj is not incident to a chord of G„. Informally, rj 
gives the rightmost chord incident to Vj. Notice that, since g > 1, fi is defined. 

If ri > 3 then the subgraph of G„ induced by the nodes vi,x,ui, . . . , is a 
cycle with at least 6 nodes and chord distance at most 1. According to LemmaEI 
this subgraph contains one subgraph among those listed in items from (i.) to (iv.) 
of the statement. 

In the remainder of this proof we assume r\ < 3. Since ri is defined, let G„/ 
be the subgraph of G„ induced by the nodes vi,V2, ... ,Vq,y,Up, Up-i , . . . , 11^. 
Informally, G„/ is one of the two cycles obtained by “cutting” G„ by means of 
chord {vi,Uri). Cycle G„/ has n' > n — 3 nodes (because ri < 3) and chord 
distance at most g — 1. 

Now we show that either G„ coincides with the graph listed in item (v.) of 
the statement, or G„' is forbidden. In the latter case we can recursively apply 
to Gn' this proof. 

According to Theorem [21 Gn' is forbidden when cd{Gn') < Since 

cd(Gn') < g — I, this corresponds to show that > g — I holds; and, since 
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Fig. 2. The clepsydra graph cl(k) (see Definitions^ . The dashed edges may or may not 
exist. 

n' > n — 3, in turns it corresponds to show that > g — 1 holds. This 

relation is equivalent to the following inequality: 

n > 2g + 5. (1) 

Since we are considering a chord distance q such that 1 < g < then n > 
2q + 4. This implies that Eqn. [T] trivially holds when n > 2g + 5, and it remains 
to be analyzed the case n = 2g + 4. In this case the cycle C„ under consideration 
has size n = 2g + 4 and chord distance cd(C'„) = g: this means that is a 
forbidden graph (see item (v.) of the statement). This concludes the proof. □ 

The following definition introduces the notion of clepsydra graph (see Fig.|2]). 

Definition 5. Let Cg = (^, itg, M 4 , rtg) be a eycle such that cd(C'^) < 1 
and deg{u'^) = 2, Pk = (vi,V 2 , ■ ■ ■ ,Vk) be a path with k > 1, and C'l = 
(u", ^ 2 , U 3 , U 4 , Mg) be a cycle such that cd(Cg) < 1 and deg{u'{) = 2. A clepsydra 
of order k is a graph cl{k) = {V, E) such that: 

- V = {m'i, M2, M3, M4, Ml, M2, . . . , Mfe, M2 , M3, M4, Mg }; 

— ifu,v€V then (m,m) G E if and only if one of the following condition holds: 

1 . (m,m) G {(Mi,Mi),(M4,Mi),(Mfe,M2),(Mfc,Mg)} 

2. u and v are adjacent in Cg, Pk, or C'f 

Notice that, since there are 3 possible cycles Cs having chord distance at 
most 1, then for a fixed k there exist 9 possible clepsydrae cl{k). 

Lemma 8. Let Cn, n > A + 2i and i > 1, be a cycle having chord distance 
cd{Cn) < i- If Cn G DH(|,*) then Cn contains a clepsydra as induced subgraph. 

Proof. Let (x,Vi,V 2 , . ■ . ,Vq,y,Up, Mp_i, . . . , Mi) be the cycle Cn, and let (mi, M2, 

. . . , Vq) be the nodes giving the chord distance of Cn- From hypothesis, it follows 
that p+g + 2>4 + 2i and q < i. From these two inequalities, we get a lower 
bound for p: 

p>2i + 2 — q>2i + 2 — i = i + 2. 
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According to the definition of chord distance, it follows that: 



scA^,y) > 



p+l ^ i + 3 
i + 1 ~ i+1 



1 + 



2 

i + l 



This observation implies that, if j = 1 or i = 2 then sc„.{x,y) > § and hence 
Cn ^ DH(|,*). We prove the theorem for i > 3 by showing that either 
contains a clepsydra or Cn contains a cycle such that n' > 4 + 2{i — 1) and 
cd{Cn') = i — 1 (and hence we can recursively apply to Cn' this proof). 

In the remainder of the proof, if 1 < J < g then we denote by Vj the 
largest index j' such that Vj and Uj> are connected by a chord of Cn, i.e. 
Tj = max{/ I (vj,Uj') is a chord of C„}; we assume rj undefined when Vj is 
not incident to a chord of Cn- Informally, rj gives the rightmost chord incident 
to Vj. In a similar way, Ij — min{j' | (vj, Uj>) is a chord of Cn} gives the leftmost 
chord incident to Vj. By definition of chord distance, h and Iq are both defined; 
Ij, 1 < j < q, may or may not be defined. 

Now, assuming Ij, 1 < j < q, defined, let Cn' and Cn" be the subgraphs of 
Cn induced by the nodes x,vi,V2, ■ ■ ■ , Vj,ui - , . . . , m and Vj,Vj+i, . .. ,Vq,y, 

Up,Up-i, . . . ,ui-, respectively. Informally, Cn' and Cn" are the cycles obtained 
by “cutting” Cn by means of chord (vj,ui.). 

Cycle Cn' has n' = Ij + j + ^ nodes; as a consequence, cycle Cn" has 



n" = n — n' + 2 > 4: + 2i — {I j + j + 1) + 2 = 2i + b — I j — j 



nodes. Moreover, it is easy to see that cd{Cn') < J — 1 and cd{Cn") < q — j + 1- 

We may assume that n' < 4 + 2(j — 1), otherwise n' > 4 + 2(j — 1) and 
cd(Cn') < j — 1 imply that we can recursively apply this proof to Cn' - Similarly, 
we may assume that n" < 4 + 2{q — j + 1), otherwise n" > 4 + 2{q — j + 1) 
and cd{Cn") < 9 — j + 1 imply that we can recursively apply this proof to Cn"- 
Inequality n' < 4 + 2(j — 1) can be rewritten as Ij + j + 1 < 4 + 2(j — 1), from 
which Ij < J ' + 1 and hence Ij < j follows. Similarly, from n" > 2i + 5 — Ij — j and 
n" < 4 + 2(g — j + 1) we get Ij > 2{i — q)+j. Relations Ij < j and Ij > 2{i — q)+j 
hold only if Ij = j. 

Now, if V 2 is not incident to a chord, let k = min{j' | Iji is defined}. Since Iq 
is defined, it follows 3 < k < q. Now, let Cn'" be the subgraphs of Cn induced 
by the nodes x,vi,V 2 , - - - . . . ,ui- Since h = k > 3, cycle Cn'" 

has n'” = 2fc + 1 >7 nodes and chord distance equal to 1. This contradicts 
Cn G DH(|, *) since s{Cn'") > |. 

As a consequence, I 2 must be defined and, since Ij = j, then I 2 = 2. Chord 
(v 2 ,U 2 ) creates the cycle C 5 = {x,vi,V 2 ,U 2 ,ui) having chord distance at most 1 
(possibly, chords incident to vi). 

Symmetrically, the arguments above can be applied to show that Vq-i must 
be defined and that Vq-i = p — 1. Hence, chord (uq_i,Ur,_i) creates the cycle 
^5 = {vq-i,yq,y,Up,Up-i) having chord distance at most one (chord possibly 
incident to Vq). Finally, cycles C 5 and C'f along with path {v 2 ,v^, . . . ,Vq — 1) 
form the requested clepsydra. □ 

In [7] the authors proved that C G DH(|, *) if and only if G does not contain 
graphs listed in items from (i.) to (iv.) of Theorem [7| as induced subgraphs. 



76 



Serafino Cicerone, Gianluca D’Ermiliis, and Gabriele Di Stefano 



As a consequence, we can characterize any graph G in DH(1,+) in terms of 
the following forbidden subgraphs of G*: clepsydra and forbidden subgraphs of 
DH(|,*). Since none of these subgraphs have false twins, by Lemma 0 we can 
characterize graphs in DH(1,+) without referring to twin graphs: 

Theorem 4. Let G be a graph. G G DH(1, +) if and only if the following graphs 
are not indueed subgraphs of G: 

i. Hn, for eaeh n > 6; 
ii. cycles Gq with cd{Ge) = 1; 

Hi. cycles Gr with cd{Gr) = 1; 

iv. cycles Cs with cd{Gs) = 1 or cd{Gs) = 2; 

V. clepsydrae. 

This theorem give us a basis to device a polynomial algorithm for the recog- 
nition of graphs in DH(1, -b). By Lemma[T] we know that DH(1, -b) C DH(|, *), 
and comparing the characterization of graphs in DH(|, *) recalled above and 2] 
it follows that a graph G belongs to DH(1, -b) if it belongs to DH(|, *) and does 
not contain a clepsydra as induced subgraph. In [7], the authors give a poly- 
nomial algorithm for the recognizing whether a graph G belongs to DH(|,*); 
as a consequence, for our purposes it is sufficient to devise a polynomial time 
algorithm to check whether G contains a clepsydra as induced subgraph. Here 
is a brute force algorithm. 

For each connected component of G, find all the cycles G 5 with cd(G 5 ) < 1. 
Then, consider a pair of cycles (G5, G'f) with at most one node in common, and 
two nodes Vi G G 5 and Vk £ G'f both with degree 2 in Gg and G" respectively. 
Consider the induced subgraph G' of G obtained by removing all the nodes in 
Gg and Gg, but vi and Vk. If vi and Vk are connected in G', then G contains a 
clepsydra as induced subgraph. If necessary, repeat the above steps for all the 
possible pairs of cycles and nodes vi and Vk. 

As regards the time complexity of the algorithm, it is obviously polynomial. 
The correctness, not given here, is obtained by proving that either Gg, G" , and 
the path connecting vi and Vk in G' induce a clepsydra or, otherwise, there exists 
another clepsydra in G. 

Theorem 5. The recognition problem for the class DH(l,-b) can be solved in 
polynomial time. 

6 Conclusions 

In this paper we have introduced the (fc, -b)-distance-hereditary graphs, a para- 
metric extension of the class of distance-hereditary graphs. These graphs can 
model communication networks having desirable connectivity properties. 

In spite of the results provided in this work, many problems are left open. 
Among the most relevant, what is the largest constant k such that the recogni- 
tion problem for DH(fc, -b) can be solved in polynomial time? Moreover, several 
algorithmic problems are solvable in polynomial time for DH(0, -b). Can some of 
these results be extended to DH(A:,-b), k > 01 
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Abstract. Treewidth is generally regarded as one of the most useful pa- 
rameterizations of a graph’s construction. Clique-width is a similar pa- 
rameterizations that shares one of the powerful properties of treewidth, 
namely: if a graph is of bounded treewidth (or clique- width), then there 
is a polynomial time algorithm for any graph problem expressible in 
Monadic Second Order Logic, using quantihers on vertices (in the case of 
clique- width you must assume a clique- width parse expression is given). 
In studying the relationship between treewidth and clique-width, Cour- 
celle and Olariu showed that any graph of bounded treewidth is also of 
bounded clique-width; in particular, for any graph G with treewidth k, 
the clique-width of G < 4 * 2*’“^ -|- 1. 

In this paper, we improve this result to the clique-width of G < 3 * 2*’“^ 
and more importantly show that there is an exponential lower bound 
on this relationship. In particular, for any k, there is a graph G with 
treewidth = k where the clique- width of G > 2^*’/^!“^. 



1 Introduction 

One of the most fruitful graph theoretical developments of the last few decades 
has been the concept of treewidth, pioneered by Robertson and Seymour. Loosely 
speaking, the treewidth of a graph captures a way of constructing the graph in a 
“tree like” fashion. The lower the treewidth of a graph, the closer it is to being a 
tree (connected treewidth 1 graphs are precisely trees) . One of the major results 
in this area is that any problem expressible in Monadic Second Order Logic 
(which includes many NP-complete graph problems) when restricted to graphs 
of bounded treewidth k, has a linear time algorithm (albeit with a constant that 
grows exponentially with k). Although this result is very far reaching, it is still 
somewhat dissatisfying since many classes of “tame” graphs, for example cliques, 
have arbitrarily high treewidth, yet have simple linear time algorithms for most 
of the problems mentioned above. 

The clique-width of a graph is another attempt to parameterize the con- 
struction of a graph so that sweeping claims can be made about the graph’s 
tractability for polynomial time solutions to difficult problems. 
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The clique- width of a graph G, denoted by cwd{G), is defined as the minimum 
number of labels needed to construct G, using the four graph operations: creation 
of a new vertex v with label i (denoted i{v)), disjoint union (©), connecting 
vertices with specified labels (ry) and renaming labels (p). The construction of a 
graph G using the above four operation is represented by an algebraic expression 
called a k-expression where k is the number of labels used in the expression. More 
details are given in section This notion was first introduced by Courcelle, 
Engelfriet and Rozenberg in and has been studied extensively in recent years. 

For example, cographs (graphs with no induced P4S) are exactly the graphs 
of clique-width at most 2, and trees have clique- width at most 3, 0. P^-sparse 
(every 5 vertices have at most one P4) and P4-tidy graphs (no induced P4 has 
more than one partner, where a partner is a vertex whose inclusion in the P4 
results in at least two distinct P4S) have clique- width at most 4, |Z]. The (g,g-4) 
graphs for q > 4 and {q, 9 — 3) graphs for g > 7 have clique-width at most q 
|7I14| . A (q,t) graph is a graph in which every subgraph induced by q vertices 
contains at most t induced Pr’s. 

As mentioned above, the motivation for studying clique-width is analogous 
to that of treewidth. In particular, problems defined by Monadic Second Order 
Logic formulas, using quantifiers on vertices but not on edges, can be solved in 
polynomial time on any class of graphs C of clique- width at most k, for some 
fixed k, assuming that the /c-expression defining the input graph is given. For 
details, cf. m- In addition, polynomial time algorithms can be obtained for 
other problems such as chromatic number, edge dominating set jl3J and IDq- 
partition problems [TO], on any class of graphs C of clique- width at most k, for 
some fixed k, assuming that the fc-expression defining the input graph is given. 

This raises the obvious question about the relationship between treewidth 
and clique-width. Courcelle and Olariu proved the following theorem. 

Theorem 1 ([S|). If the treewidth of G is k, then cwd{G) < 2^+^ -|- 1 (= 4 + 
2'=-! -h 1). 

This theorem guarantees that any class of graphs of bounded treewidth is also 
of bounded clique-width and thus, from the perspective of algorithmic tractabil- 
ity, clique- width is a more powerful concept. Note that since the clique- width of 
a clique on n vertices is at most 2 and its treewidth is n — 1, the gap between a 
graph’s treewidth and clique- width can be arbitrarily high. The above theorem 
also raises the questions of whether the bound can be improved (note that for 
trees, k = 1 and the theorem guarantees that the clique-width of a tree is < 5, 
whereas it is known that the clique width of a tree is < 3) and more importantly, 
whether the exponential bound represented in the theorem can be replaced by 
a polynomial bound. To make this more precise, we let Qk = {G : twd{G) = k} 
where twd{G) denotes the treewidth of G, and want to determine whether f{Gk) 
= max{cwd{G) : G S Qk} can grow polynomially with k. 

In answer to these two questions, our paper proves the following two theo- 



rems: 
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Theorem 2. f{Gk) > (i.e., for any k, there is a graph G where 

twd{G) = k and cwd{G) > 



Theorem 3. If the treewidth of G is k, then cwd{G) <3*2^ 

A notion similar to clique-width is the notion of NLC-width. The NLC- 
width of a graph G, denoted as NLGwidth{G), is the minimum number of 
labels needed to construct G using two graph operations called union (X 5 ) and 
relabeling (o^), see [T3] for details. For every graph G, NLCwidth{G) < cwd{G) 
and cwd{G) < 2 x NLCwidth(G) [12]. If G is of treewidth at most k then 
N LGwidth(G) <4* 2^“^ — 1, [15]. Using these results, Johansson [12] showed 
that if G is of treewidth at most k then cwd{G) < 4 * 2*^“^, thereby slightly 
tightening the Courcelle-Olariu bound. 

By the relationship between clique-width and NLC-width mentioned above 
we obtain from Theorem[^that for every k, there is a graph G, where twd(G) = k 
and NLCwidth(G) > Similarly, we obtain from Theorem [3] that if the 

treewidth of G is k, then N LGwidth{G) < 3 * 2^“^. 

Overview of the paper: 



The second section of the paper introduces the notation and definitions used 
throughout the paper. In particular, we present the concept of fc-trees and partial 
A:-trees. (Note that partial fc-trees encompass an alternative definition of graphs 
having treewidth k.) We also describe how to construct the fc-trees that will be 
used in the proofs of the two theorems. Sections El and S] present overviews of 
the proofs of Theorem 12] and 0 respectively. The paper ends with concluding 
remarks and directions for further research. 



2 Notation and Definitions 

The graphs we consider in this paper are undirected and loop-free. For a graph 
G and a vertex u of G, we denote by Ng{u) the neighborhood of u in G, which 
is the set of all vertices in G which are adjacent to u. We denote by Ng[u] the 
closed neighborhood of m in G which is equal to Ng{u) U {u}. 

We first give more details on the definition of clique-width presented above. 
The clique-width of a graph G, denoted by cwd{G), is defined as the minimum 
number of labels needed to construct G, using the four graph operations: creation 
of a new vertex v with label i (denoted f(r’)), disjoint union (©), connecting 
vertices with specified labels (rj) and renaming labels (p). The operation r]ij 
{i j) adds all edges (that are not already present) between every vertex of 
label i and every vertex of label j. The operation Pi^j renames all vertices of 
label i with label j. An expression built from the above four operations using 
fc labels is called a k-expression. Each fc-expression t uniquely defines a labeled 
graph valft) where the labels are integers 1 , . . . , fc associated with the vertices 
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and each vertex has exactly one label. We say that a fc-expression t defines a 
graph G if G is equal to the graph obtained from the labeled graph val{t) after 
removing its labels. The clique-width of a graph G is equal to the minimum k 
such that there exists a ^-expression defining G. 

For a fc-expression t defining a graph G, we denote by tree{t) the parse tree 
constructed from t in the usual way. The leaves of this tree are the vertices of G 
with their initial labels, and the internal nodes correspond to the operations of 
t and can be either binary corresponding to ©, or unary corresponding to rj or 
p.ll k = cwd{G) we say that t is a clique-width expression of G and we say that 
treeit) is a clique-width parse tree for G. For a vertex rt of G and an internal 
node a of tree{t), such that u occurs (as a leaf) in the subtree of tree{t) rooted 
at a, the label of u at a is defined as the label that u has immediately before the 
operation a is applied on the subtree of treeft) rooted at a. 

As mentioned above, G being a partial k-tree is equivalent to G having 
treewidth at most k. We now define fc-trees and present notation for the A:-tree 
that will be used to establish the lower bound presented in Section El 

We denote by Kk the clique with k vertices. For a vertex v and a set of 
vertices S we say that v is universal to S' if u is adjacent to all the vertices in S. 
A graph G is a k-tree if G is either a Kk or is formed from a fc-tree G' , by adding 
a new vertex that is universal to a Kk in G'. Note that 1-trees are precisely trees. 
G is a partial k-tree (i.e., twd{G) < fc) if G is a partial subgraph of a fc-tree. 
This recursive definition of fc-trees immediately suggests that the construction 
of a given fc-tree G can be represented by a labeled tree Tq. To illustrate this 
concept, we now describe F, the fc-tree that will be used to establish the lower 
bound and the construction tree Tp corresponding to F. In this and in the next 
section we denote Tp by T. 

We let the initial Kk be on vertices {1, 2, • • • , fc}. When a vertex x is added 
to an existing fc-tree, we let Sx U {x'} denote the Kk to which x is universal. The 
vertex x' is called the parent of x, represented by p{x). Sx, the remaining vertices 
in the Kk, are called the sources of x. In the construction tree T, x becomes a 
child of the vertex x' , and Sx is the label associated with x. To “start” the tree 
T, we let fc be the root of T. v, the first vertex added to F is a child of fc and 
Sv = {1,2,-- - ,fc— 1}. To continue the construction of F (as represented by T), 
we let V have fc children Vi,V 2 , - ■ ■ ,Vk where Sy. = {Sy U {fc}) — (ij. Thus Vi is 
adjacent to the Kk formed on {1, • • • , * — 1, i -|- 1, • • • ,k,v}. Often we will denote 
Vi by “—i”, thereby indicating that the vertex i in SyUp{v) has been “replaced” 
by V, the parent of Vi. In this case we say that Vi is the —i child of its parent. 

Note that this process of giving any node inT k children can continue indefi- 
nitely. (In particular, node x has children xi,X 2 , ■ ■ ■ ,Xk where Sxi = (5'2,Up(a;)) — 
the ith element of Sx \Jp{x).) For x a node of T, we let Tx denote the subtree of 
T rooted at x and let Lf. (the hth level ofTx) denote the nodes of Tx at distance 
h from X. 

F, the graph used in Sectional consists of fc and v as described above as 
well as Ty consisting of the complete fc-ary tree up to level a = (2fc-|- 1)2L'^/^J+^. 
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Thus all leaves of T are in Note that each node of T corresponds to a vertex 
of 



8 (- 2 ) 




Fig. 1. The beginning of the construction tree T for the graph F when k = 3. 



Example 1. Figure [IJillustrates the beginning of the construction tree T for the 
graph F when k = 3. The root of this tree is k. v, the first vertex added to F is 
vertex 4 and is the child of the root. The first level of the tree T„ is = {5, 6, 7} 
and the second level of T„ is = {8, 9, • • • , 16}. We omit from the figure the 
labels Su of the nodes of the tree. Instead, we employ a (-i) label for a node u 
indicating that u is the —i child of its parent. For example, node 12 is the —3 
child of node 6 which is the —2 child of node 4. The set of sources of vertex 12 
is Si 2 = {1,4}. The parent of vertex 12 is p(12) = 6. Figure E] illustrates the 
graph corresponding to the construction tree T of Figure[H Note that the graph 
of Figure Elis the subgraph of the graph F ioi k = 3, induced by the vertices 
{1,2,... ,16}. 

For X a vertex of F, such that x ^ (1, • • • ,k} we say that is a complete 
tree if is not empty (i.e., the distance in T between x and v is at most 
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Fig. 2. The graph corresponding to the construction tree of Figure [T) 



a — (2k + 1).) We now state, without proofs, some properties of complete trees 
that will be used in subsequent arguments. 

Claim 1 Let T„ be a complete tree in T. Then for every R C Su, there exists 
vertex r^ G fl such that C\ Su = R- Furthermore we can choose 

these vertices such that the set TZ = {r/j : R C Su} — {u} is an independent set. 

Corollary 1. Let T„ be a complete tree in T. Then for any Q C Su with qi,q 2 G 
Q, there exist vertices u\,U 2 G Tu, at level at most 2A: -|- 1 ofTu, such that u\ 
is neither an ancestor nor a descendent of U 2 , Sui = Q — { 9 i} U where 
S'u. n Q = 0, t = 1, 2, all vertices of S'u. are in Tu and S'u^ fl S'u^ = 0. 

For X € T we let denote a path (x = xq,xi, ■ ■ ■ , xp) of T^ where xp is a leaf 
and : 

^Xi — Sxi_i^ i > 0 (★) 

- p(xi) = Xi-i, i > 0. 

For any vertex x £ T where T^ is a complete tree and S '£ Sx we let: = 

{y ■ y £ ASyC\Sx = 5} (i.e., denotes the set of vertices at level 2fc -I- 1 

that have S as the only Sx vertices in their sources), and Px = {Py ■ y £ X^} 
(i.e., Px denotes the paths rooted at y £ X^ that satisfy (*) above). 
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Claim 2 |X|| > (fc - |S'|)'=+i . 

In order to argue about the clique-width of F, we let T denote a clique-width 
expression tree for F. For a an internal node of T, we let Ta denote the subtree 
of T rooted at a and let Va represent the leaves (i.e., vertices of F) of 7^. As the 
following claim shows, having a set of vertices outside Va together with certain 
neighbours inside Va establishes a lower bound on the number of distinct labels 
required at a (i.e., to label the vertices of Va at node a of T). 

Claim 3 Let X be a set of I vertices in V — Va where for each subset of these 
vertices, there is a vertex inside Va adjacent to all vertices of this subset of X 
and to no other vertices in X . Then at least 2* labels are required to label the 
vertices of Va ■ 

For 7i a subtree of T and Py a path of T satisfying (*), we say that Py is 
full with respect to 7i if all vertices of Py are in 7i. Given a P^ we will often 
choose a lowest possible internal node of T, say a, such that some path in P^ is 
full with respect to Ta- (i-e., for some path P G P^ all of its vertices are in Va] 
P is said to be full with respect to a.) If fl 14 = 0, for some u, we say that 
is empty with respect to a. 

The following claim is the key to the lower bound result. First, it is the 
bottleneck for improving our lower bound; in particular, is the largest 

value for which we could find a proof and any improvement in this value would 
be a major step towards achieving a better lower bound. Second, the claim is the 
basis for the inductive proof of the lower bound. Roughly speaking, the proof 
first establishes a vertex o in T such that S*, a small set of vertices of F all 
must have unique labels at a. These unique labels are required because of the full 
path with respect to a. Now, the existance of a subtree of T that is empty with 
respect to a allows other vertices of T to be identified so that the claim may be 
reused. By doing so, a new vertex is added to S* and the process of incrementing 
S* by one vertex at a time continues until the lower bound is surpassed. 

Claim 4 Assume cwd{F) < Let x be a node in T where is a com- 

plete tree, let S be a strict subset of Sx, and let a be a lowest node in T such 
that some path in P^ is full with respect to a. Then there is a y G X^ such that 
Ty is empty with respect to a. 



3 Lower Bound 

The purpose of this section is to give an overview of the techniques used to 
establish the lower bound stated in Theorem [3 

Theorem [3 /(C/fc) > 

In particular we will examine the fc-tree F defined in Section [3 and proceed by 
assuming cwd{F) < 
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We let T be a clique-width expression tree for F . Initially we will examine 
the vertices in and let a be a lowest node in T such that some path in 

-f?’ i® full with respect to a. We will then show that at least [fc/2j vertices 

of {2, - ■ ■ ,k — 1} are in W and have unique labels at a. We then proceed by 
induction by identifying other vertices in T and looking at b, a lowest node in T 
such that some path rooted at one of these vertices is full with respect to b. We 
show that Va Q Vh and that the set of vertices requiring unique labels at a also 
requires unique labels at b and that this set must be augmented by a new vertex 
that requires a new label. This augmentation continues until eventually this set 
has cardinality equal to thereby proving the theorem, a, the depth of 

T is chosen so that all subtrees used to generate full paths, are complete and 
have depth at least +2k + \. The length of the full paths generated by 

these subtrees will be at least 2 , since the root of each such path is at level 

2fc -I- 1 of its corresponding subtree. 

Following the above outline, we start by stating some claims about 7^. 

Claim 5 At least \ k/2\ vertices of Sy — {1} (i.e., {2, • • • ,k — l}) are in Va and 
each must have a unique label at a. 



We now prepare for the general induction step, each execution of which will 
add one more vertex requiring a new unique label. Thus executing the induction 
step — \k/2\ times will complete the proof of the theorem. First we 

let S* denote a subset of Sy — {1} such that all elements of S* are in Va and 
[S’*! = \k/2\ . (Note that there may be more elements of Sy — {1} that are in Va 
but we only consider \k/2\ of them.) By Claim |5] the set S* exists and all the 
vertices of S* have different labels at a. 

Let M be a vertex in Xy ‘ where is empty with respect to a. {u is 
guaranteed by Claim 01) Let si and S 2 be arbitrary vertices in S*. By Corollary 
[1] Ty contains vertices u\ and U 2 , at level at most 2fc -I- 1 of T^, such that ui is 
neither an ancestor nor a descendant of U2 and: 

- Su, = S*^ US'yX=l,2 where S*. = S* - {sj 

- S'y.(lS*'=^ i = l,2 

- = 0 . 

Note that [S'*. | = [fc/2j — 1. Let m' be an arbitrary element in Sy. for i = 1,2 

and set U = {ui,M2}. Clearly S* = ■ Ui G F}. Let Pu = {JiPuf' : 

Ui G U} and define 6 to be a lowest vertex in T such that some path in Pu is 
full with respect to b. 

Claim 6 b is an ancestor of a. 



Claim 7 For all Ui G U , there is a y G Xyf' such that Ty is empty with 
respect to b. 
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Claim 8 All S* vertices have different labels at b. 

Let ui € U satisfy the existence of a vertex z € xffy such that z is the 
root of a full path with respect to b. 

Claim 9 There are at least [k/2\ vertices of Su^ ~ {^i} that are in 14 . 

Since = [k/2\ — 1, there is a vertex vf G — {u'^} that is in I 4 . 

Claim 10 At b, the label of uf is different than the labels needed for the S* 
vertices. 



Thus this step has shown that at least \k/2\ +1 distinct labels are required 
at b. To continue the process, we will augment the set U and define c to be a 
lowest vertex in T such that some path in Pu is full with respect to c. It will then 
be shown that c is an ancestor of b and that all vertices in S* must have different 
labels at c. A new vertex will be identified that must have a different label at 
c than all vertices in S* and it will be added to S* . This process continues by 
renaming c to be 6 and augmenting U . For augmenting U we shall use the two 
vertices and u \,2 guaranteed by the following claim. 

Claim 11 Let s be an arbitrary vertex in S*_^ . Then there exists vertices iti_i 
and ui ^2 in Tui such that u\^i is neither an ancestor nor a descendant of ui ^2 
and: 



'S'ui,! 









Augmenting U\ U\ (a vertex in U such that there is z G where z is 

the root of a full path with respect to b) will be replaced by the vertices u\^i 
and ui ^2 satisfying the conditions defined in Claim [TT] As before, we let u[ i and 
u'l 2 be arbitrary elements in ^ and 5'^^ ^ respectively. After reindexing the 
elements of ?7 to be mi, M 2 , • • • , we define: 

- S* = Ul-S*;, : M, G C/} and 

- Pu = [j{Pu:'~^"'^ ■■u.eU}. 

We now state various claims about U. 

Claim 12 The following hold for U : 

1. No vertex in U is an ancestor (in T) of any other vertex in U . 

2. Vm €U,Su = SlA S'^ where |5:| = \k/2\ - 1. 

3. Vm, w distinct elements of U, S'^D = $. 

4-. Vm G U , there is a vertex y G ^ such that Ty is empty with respect to 

b. 
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5. All vertices in S* have different labels at b. 

We now define c to be a lowest vertex in T such that some path in Pjj is full 
with respect to c. (Note that c could be b.) In order for the process to continue, it 
can be shown that analogues (where b is replaced by c) to Claims 0 [T] [S] E] and 
[To] hold. Where appropriate, the proofs use the facts established in Claim [TJ] 

Finally, we rename c to be 6 and again augment U by replacing u\ with 
the vertices ui,i and u \^2 guaranteed by Claim fTTl This augmentation continues 
until [S'*! > thereby completing the proof of Theorem [2] Note that for 

these augmentation steps to be valid, for every vertex u in U the depth of the 
tree T„ must be at least 2fc -|- 1 -I- 2L^^^J . This is assured by our initial choice of 
a, the depth of Ty having been set to (2k + 1)2L^/^J+^. 

4 Upper Bound 

We now turn our attention to establishing the upper bound stated in Theorem |2| 

Theorem [S]// the treewidth of G is k, then cwd(G) < 3 * 2^“^. 

To prove this theorem, we will first prove it for G a fc-tree and then indicate 
the modifications for partial fc-trees. We start by developing notation for G, 
consistent with that presented in Section We then present an algorithm that 
is guaranteed to generate a clique- width expression tree that uses at most 3*2^“^ 
labels. It should be clear to the reader that this algorithm is an extension of the 
standard algorithm that shows that the clique-width of a tree (i.e., a 1-tree) is 
at most 3. 

Given G, let T be a construction tree for it and let x be an arbitrary vertex 
in T where Sx = {1, 2, • • • , fc — 1} and p(x) = k. x can be the root of many 
subtrees of the construction tree of G. Each child of x will be a —i node for some 
i S {1,2,--- ,k}. (i.e., such a node will be universal to the fc-clique consisting 
of {1, 2, • • • ,k} — {i} U {x}.) Note that the subtree rooted at a —i child of x 
could contain vertices universal to any subset of {1, 2, • • • ,k} — {z} U {a;}. To 
handle this situation, the recursive labeling of the subtree will assign up to 2^ 
labels to the vertices of the subtree, thereby allowing all possible connections 
into {1, 2, • • • ,k} — {i} U {a:}. 

The Algorithm: 



The algorithm will proceed by bottom-up dynamic programming on T. At each 
stage it will assume that the subtrees whose roots are children of a particular 
X have been labeled using at most 3 * 2^“^ labels and that each vertex in the 
subtree rooted at a —i child of x is labeled with one of 2^ labels indicating the 
subset of {1, 2, • • • , A:} — {i}U{a;} to which it is universal. Clearly this assumption 
holds for the leaves of T. 

The algorithm, using at most 3 * 2*“^ labels, will label the vertices of U 
so that each vertex in Tx ends with one of 2^ labels indicating the subset of 
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{1, 2, • • • , k} to which it is adjacent. For 1 < i < fc, let denote a —i child of 

X. Let 7i = {Ty^ : Ui is a, —i child of x}. All vertices in adjacent to the same 

subset of {1, 2, • • • , fc} — {i} U {a;} will be assumed to have the same label. 

We now describe the steps of the algorithm for labeling the vertices in . 

1. Construct a new union node with the two children being the expression tree 
for 7j and x. x is given a new label, so we now have 2^ + 1 labels being 
used. The labels of vertices that are going to be involved in subsequent rj 
operations are considered to be “active” . Thus at this stage we have 2^ + 1 
“active” labels consisting of the label of x and the labels corresponding to 
all subsets of {2, 3, • • • , k, x}. Now we add all edges to x and COLLAPSE 
label class X U {a;} into X for each X C {2, 3, • • • ,k}. Thus we have 2^“^ + 1 
“active” labels. We let T' denote the current expression tree. 

2. For i from 2 to fc consider the subtrees rooted at the —i children of x: For 
each i, we augment T' by adding a new union node with the two children 
being the expression tree for 7i and T' . We make sure via p operations 
that the vertices in the expression tree for Ti that are adjacent to subsets 
of {i + 1,-- - ,fc} will have the same label as the vertices in T' which are 
adjacent to the same subsets of {i+ 1, • • • , k}. Any vertices of Tj adjacent to 
{ 1 , • • • , i — 1 } union a subset of{i+l,---A:,a:} are given new “active” labels . 
Now we add all edges to x and COLLAPSE label class X U {x} into X for 
each A C {1, • • • , fc} — {zj. 

The algorithm ensures that vertices in different subtrees adjacent to the same 
subset of {1, 2, • • • , fc} will have the same label. (For example, if fc = 4, then step 1 
ensures that x which is adjacent to {1, 2, 3, 4} has an “active” label; furthermore, 
in this step all vertices adjacent to each X C {2, 3, 4} have “active” labels. When 
z = 2 in step 2, vertices adjacent to any of the subsets: {!}, {1, 3}, {1,4}, (1, 3,4} 
will be given “active” labels. When z = 3, the subsets (1, 2}, (1, 2,4} are added 
and when z = 4, the subset {1, 2, 3} is added.) The COLLAPSE operation ensures 
that once an edge has been added from a vertex y to x, the label of y is reset 
to be that of X where X U {x} is the subset of (1, 2, • • • , fc} U {cc} to which y is 
adjacent. 

To prove the theorem, we show that the above algorithm constructs G and 
uses no more than 3 * labels. The fact that G is constructed follows by 
a straightforward induction argument. The proof that the number of labels is 
bounded by 3 * 2^“^ is contained in the journal version of the paper. 

5 Concluding Remarks 

The most important result in this paper is the fact that the clique-width of a 
graph can be exponentially higher than its treewidth. We fully expect that the 
bound expressed in Theorem [2] can be improved; the most interesting question, 
however, is to find a lower bound that confirms the bound achieved through some 
algorithm. In particular, we expect that the upper bound expressed in Theorem 
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ID is the best possible. Note that it is correct for fc = 1 (trees) and for fc = 2 
(shown by a tedious exhaustive argument). 

It is well appreciated that the most important open problems in the study 
of clique-width are the resolution of the general recognition problem (given G 
and integer k, is cwd{G) < k - strongly believed to be NP-complete) and the 
fixed k recognition problem for k > 4. (The fixed k recognition problem can be 
solved in polynomial time for fc < 3, in particular the case when k = 1 is triv- 
ial, for k = 2 these are the cographs which can be recognized in linear time 
1 ^ and for fc = 3 an 0{n^m) algorithm is presented in [^). It is interesting to 
note that the corresponding problems for treewidth are resolved. In particular, 
Arnborg, Cornell and Proskurowski [I] showed that the general treewidth recog- 
nition problem (given G and integer k, is twd(G) < k) is NP-complete whereas 
the fixed recognition problem is in P. Bodlaender | 2 ] later presented a linear time 
algorithm for the fixed k treewidth recognition problem. 

It seems as though the stumbling block for both problems is the difficulty in 
developing good arguments to provide strong lower bounds on the clique-width 
of a graph. Hopefully the techniques presented in Section E] can assist. 

Another avenue of promising research is the development of polynomial time 
algorithms to determine the clique-width of restricted families of graphs, espe- 
cially those where the clique-width can be arbitrarily large. Such families include 
permutation graphs (even bipartite permutation graphs E), planar graphs and 
interval graphs (even unit interval graphs) jllj . Again progress in this area de- 
pends on lower bound arguments that show that the clique-width achieved by 
a particular algorithm is the best possible. It is interesting to note that it is an 
open and seemingly difficult problem to find a clique- width algorithm for A:-trees, 
even for k = 2. 
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Abstract. The 2-level cactus introduced by Dinitz and Nutov in |5] is 
a data structure that represents the minimum and minimum+1 edge- 
cuts of an undirected connected multi-graph G in a compact way. In 
this paper, we study planarity of the 2-level cactus, which can be used, 
e.g., in graph drawing. We give a new sufficient planarity criterion in 
terms of projection paths over a spanning subtree of a graph. Using this 
criterion, we show that the 2-level cactus of G is planar if the cardinality 
of a minimum edge-cut of G is not equal to 2, 3 or 5. On the other 
hand, we give examples for non-planar 2-level cacti of graphs with these 
connectivities. 

1 Introduction 

Edge connectivity is a fundamental structural property of a graph. In the last 
decade, not only the properties of minimum cuts but also the number uwm 
and structure [T] of near minimum cuts were examined. Galil and Italiano [8] and 
Dinitz and Westbrook |3I7J developed models for all 1 and 2 cuts and all 2 and 3 
cuts, respectively. Based on these two models, Dinitz and Nutov introduced the 
so called 2-level cactus model - a data structure that represents the minimum and 
minimum-1- 1 edge cuts of an undirected multi-graph with connectivity A > 3 in 
a compact way [^. There is no other so compact model, and no other compact 
graph model for these cuts known, for the best of our knowledge. The above 
models imply, in particular, fast incremental maintenance algorithms mai]. 

The 2-level cactus model (or “2-level cactus” , for simplicity) generalizes the 
cactus model of all minimum cuts [1|. In case of odd connectivity A > 3, the 
2-level cactus is really a cactus, that is a connected graph in which every edge is 
contained in at most one simple cycle. Some cuts, however, are represented only 
implicitly in the graph of the model. In order to reduce this implicitness, we add 
some auxiliary edges. We call the resulting graph the extended 2-level cactus. 
The main question considered in this paper is whether the modeling graph is 
planar, for both odd and even cases. 

The proof of planarity is based on properties of the set of projection paths of 
auxiliary edges, that is the set of (shortest) paths in the 2-level cactus between 
the end nodes of auxiliary edges. To obtain planarity, we give a new sufficient 
planarity criterion, generalizing a corollary to the criterion of MacLane m- 
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The question of planarity is not only of graph theoretical interest, it is also 
useful for algorithmic purposes. But the main reason, why we study planarity 
of the 2-level cactus, is an application in graph drawing. We are interested in 
drawing all small cuts (“bottlenecks”) of a graph. Clearly, for a presentation on a 
screen, planarity is a key property. A first approach in visualizing the minimum 
cuts is done in |^, utilizing the cactus model for all minimum cuts [4j. The 2-level 
cactus might be a next step in this direction. 

This paper is organized as follows. Sect. |2] first introduces MacLane’s pla- 
narity criterion. Then, we generalize a corollary of MacLane’s criterion. Sect. 
introduces the 2-level cactus. In Sect. |4j we show, first, that it is planar for 
connectivity A = 1 and for any even connectivity A > 4. In case of odd con- 
nectivity, we show that the 2-level cactus together with the auxiliary edges is 
planar if A > 5. We also give examples of non-planar (extended) 2-level cacti of 
graphs with connectivity A = 2, A = 3, and A = 5. We conclude the paper by 
Sect. El with some remarks on how we would like to choose the faces of a planar 
embedding. 



2 Planarity of Trees with Additional Edges 

Let E 1 AE 2 = Ei \ E 2 LI E 2 \ El be the ring sum of two sets Ei and A2. Let (£<3 
be the vector space on the subsets of an edge-set A of a graph G over F2 under 
the ring sum operation A. The set 3 g of all cycles and unions of edge-disjoint 
cycles is a subspace of the vector space and is called the cycle space of G. A 
2-basis of G is a basis of the cycle space of G, such that every edge occurs in at 
most two elements of this basis. 

Theorem 1 (Planarity Criterion of MacLane [llj L A graph is planar if 
and only if it has a 2-basis. Moreover, any 2-basis of a 2-eonnected graph eonsists 
of all but one facial cycle of some of its planar representations. 

A short proof of MacLane’s planarity criterion can be found in m- 

A basis of the cycle space can be constructed from a spanning tree: Let T be 
a spanning tree of a connected graph G. For an edge e = {?;, w} in G — T, let pe 
denote the set of edges on the path in T between v and w, called the projection 
path of e. Then, {{e}Upe\e edge in G — T} is a basis of the cycle space 3g- Thus, 
there is the following immediate corollary of MacLane’s Planarity Criterion. 

Corollary 1. A graph G is planar if there is a spanning tree T of G sueh that 
every edge in T is contained in at most two projeetion paths. 

The following lemma gives a sufficient planarity criterion under somewhat weaker 
conditions. 

Lemma 1. A graph G is planar if there is a spanning tree T of G such that for 
any edge e = {u, w} in T, the number of projection paths that contain e and one 
more edge in T incident to v is at most two. 
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No A's 




No 

Case: v is not a node Case: ri is a node 



Fig. 1. Illustration of the proof of Lemma [T] Rectangularly shaped vertices are nodes. 



Proof. Let G = (F, BOS’)!!] be a connected graph such that T = {V, B) is a 
spanning tree of G that fulfills the condition of the lemma. We use Kuratowski’s 
Theorem m- 



G does not contain a K^: Suppose G contains a subdivision of a as a sub- 
graph. We call the five vertices of this nodes. Let T' be the smallest subtree of 
T that contains all nodes of the K^. Let be a leaf of T' . Then, r; is a node. Let 
e = {v,w} be the edge incident to v in T' . At most one of the four subdivision 
paths of the that connect v to the other nodes can contain vertex w. This 
situation is illustrated in Fig. [I] (left). Thus, there are at least three subdivision 
paths that (i) connect the two connected components Ti and T2 of T — {e} and 
that (ii) do not contain w. Such a subdivision path contains at least one edge 
s G S with one end vertex in Ti and one end vertex in T2. The projection path 
of s contains e and one more edge incident to w. Hence, there are at least three 
projection paths containing e and another edge incident to w. This contradicts 
the precondition. 

G does not contain a K3 3: Suppose G contains a subdivision of a K3 3. We call 
the six vertices of this nodes. We distinguish the two parts of the ^3,3 as 
white nodes and black nodes. Let T' be the smallest subtree of T that contains 
all nodes of the ^^3,3. Let r; be a vertex that has maximum degree in T' . 

Let us first consider the case that deg2./(z)) > 3 and v is not a node as 
illustrated in Fig.[T] (middle). At most one subdivision path can contain v. So, 
there is at least one connected component T{ of T' — {u} that contains none of 
the end nodes of such a subdivision path. Let e be the edge that connects T{ to 
V in T' . Each subdivision path incident to a node in T[ and a node that is not in 
T{ must contain an edge from S whose projection path contains e and another 
edge incident to v. If T[ contains b black nodes and w white nodes, there are 



^ With M1UM2 we denote the disjoint union of two sets Mi and M2. 
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6(3 — ui) + w{3 — b) such paths. As we have 1 < 6 + w < 4, there are at least 3 
such paths. This contradicts the precondition. 

Now consider the case that deg-j^, (v) > 3 and n is a node - say a black node 
as illustrated in Fig. [T] (right) . Let T{ be a connected component of T' — {v} that 
contains at least one black node. Let e be the edge that connects T( to v in T' . 
If T{ contains 6 black nodes and w white nodes, there are 6(3 — w) + rc(3 — 6 — 1) 
subdivision paths between nodes in T( and nodes not in T( + {u}. As we have 
1 < 6 + ic < 3, there are at least 3 such paths and thus at least 3 projection 
paths containing e and another edge incident to u - a contradiction. 

If degy/(u) = 2, tree T' is a path. Without loss of generality, we can assume 
that V is the third node in the path. Then, we can use the same argumentation 
as in the previous case. □ 



3 The 2-Level Cactus Model 

Throughout the rest of this paper, let G = (V,E) be an undirected connected 
multi-graph. Even though there might be several edges of G that are incident 
to the same two vertices v and w, we denote each of them by {u,r/;}. For two 
subsets S,T C V let E{S,T) := {{s,t} G E| s G S', t G T} denote the set 
of edges in E that are incident to a vertex in S and to a vertex in T and let 
c{S,T) := |£'(S, r)| be the cardinality of this set. A non-empty proper subset 
S of E induces the cut E{S, S) of G. A 2-cut is a cut of cardinality 2. Let 
A = min 0 cscv c(S, S) denote the minimum cardinality of a cut of G. 

A cut that is induced by S divides a subset T of E if none of the two 
sets S n T and S fl T is empty. Two cuts E{S, S) and E{T, T) are crossing, if 
none of the four corner sets S fl T, S fl T, S fl T, and S fl T is empty. If not, 
they are parallel. A cut that is induced by a corner set is called a corner cut. 
G' = {V , E') results from G by shrinking a subset S of V, if C' = (E \ S)U{us} 
and E' = {E\ E{V, S)) U {{u, us}; {v, sj G E{S, S)j, that is every incidence of 
an edge in if to a vertex in S is replaced by an incidence to vs, omitting loops. 
Two cuts G and C' induce the quotient graph that results from G by shrinking 
the four corner sets. 

For A > 3, Dinitz and Nutov developed in a compact model for the A and 
(A -I- l)-cuts of a graph G, called the 2-level cactus model. In the following, we 
briefly sketch this model and summarize those properties that we use to prove 
the planarity of the 2-level cactus. 

Generally, a model for a family E of cuts of G is a triple (Q, tp,T) such that 
the model graph Q = {Af, £) is an undirected multi-graph, is a set of cuts of Q, 
and (p : V — Af is a mapping with = E, where p~^{E{S, S)) is deflned 

to be E{p~^{S),p~^{S)). We say that a cut C of Q models the cut p~^{C). The 
elements of Af are called nodes and a node v G Af with p~^{y) = 0 is called an 
empty node. 

In [5], models for the set F of A and (A -I- l)-cuts are built in the following 
way. Set F is divided into the set of those A-cuts not crossing any other A-cut in 
F, called the set of all basic cuts the set of all remaining cuts in F that do 
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not cross any cut in ^bas^ called the set of all local cuts and the set of all 

cuts that cross at least one of the cuts in called the set of all global cuts 

F^^'°. i^bas modeled by the tree 7”*’®'®. See Fig. |2^,b for an illustration to 

the construction of such a tree. 






Fig. 2. a) A set of cuts of a graph (the graph edges are shown grey). Continuous curves 
indicate basic cuts and dashed curves indicate local cuts. The dotted curve represents 
a global cut. b) Tree and c) two local models and G^2 that d) are implanted 
instead of vi and 1/2 into 7'bas nodes are empty nodes. The remainders of 

the edges in 7't’as grey, e) These remainders can be contracted at the end of the 
implantation process, f) The opening of the white halo node in (e). 



For a node ly G Af, let , . . . , be the subsets of V that are mapped on 
the connected components of — v, and let Gi, be the quotient graph of v 
that is the graph resulting from G by shrinking the sets Vf,..., Vff into a single 
vertex each. Given any subset of the local cuts, it can be partitioned into 
V G Af, where Fl°^ is defined as the set of those cuts in that do not 
divide any of the sets Vf. Assume, for all v G Af, there is a model for the cut 
set F}f^ (they can be considered as cuts of the graph Gy). Then, a model for 
F"bas y ^loc jg built by “implanting” for every node v G N with 0 a model 

Qy - called local model - for Fl°^ into See Fig.[2fc,d,e for illustration. The 
nodes of Gy with (p~^{p,) = Vf, for some i, are called halo nodes. We also call 
the vertices of Gy that correspond to the shrunken sets Vf halo vertices. Finally, 
depending on whether A is even or odd, the global cuts and the local cuts in 
\ are added in a suitable way. 

Odd connectivity. Every global cut is modeled by a 2-cut of Moreover, for 
a 2-cut {ei, 62} in F'°‘^^, let Peiea be the set of edges on the path between ei and 
62 in 7"bas^ g^ included. If cut {ei, 62} models a global cut, then, any 

2-cut {e(, 62} C Peiea does also model a cut in (|5], Lemma 5.1). The 

cuts that are modeled by a 2-cut of are called degenerate cuts. Let P be 
the set of inclusion-maximal sets Peiez^ such that {ei, 62} models a (A -|- l)-cut 
of G. The elements of P are called generating paths. 
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Lemma 2 (|^ Lemma 5.4 and proof of Lemma 5.5). 

1. Any two generating paths have at most one edge in common. 

2. Let A > 3. Let p € P and {t'jw} £ p. Except there are at most two 

edges incident to v in such that there exists a generating path containing 
{Ujty} and such an edge. 

For the local models, we consider only A > 3. Let be the set of non- 
degenerate local cuts plus the set of corner cuts of non-degenerate local cuts. 
Let v he a node of 

Lemma 3 (j5] Lemma 5.4). Let C,C he two crossing non-degenerate cuts. 
Then, the quotient graph induced by C and C is a simple cycle with edges 
between adjacent vertices. 

From this lemma and the fact that for A > 3 no degenerate cut in crosses 
another cut in Dinitz and Nutov conclude in [5] that there exists a tree of 
cycles which is a suitable local model for each node v with ^ 0. Implanting 
these local models into T”*"*® results in a cactus tree type graph Q, which will be 
called 2-level cactus. 

To make the generating paths, and thus the 2-cuts of Q modeling the global 
cuts, visible in a drawing of the 2-level cactus G, let us extend G as follows. For 
each generating path p, consider the corresponding sequence of edges in G and 
add an auxiliary edge Cp connecting the first and last end node of this sequence 
to £. We call the result extended 2-level cactus The set of edges on the 
shortest path in G between the two end nodes of Cp is called the projection path 
of Cp. Note, that it follows from Lemma EJltem ID in Sect, m that the projection 
paths are unique up to multiple edges. 

Even connectivity. For a node v of the local model Gu is either a simple 

cycle or it can be described as a tree % plus the halo nodes, where each halo 
node is connected by two additional edges to 7I-. In the latter case, the following 
property holds. For a halo node p, let Pp be the set of edges on the path in % 
between the two end nodes of the two edges incident to p, and let P be the set 
of these paths. Lemma 5.6 in gives the following properties of the paths in 
P. 

Lemma 4. 

1. Two elements of P have at most one edge in common. 

2. An edge ofTv is contained in at most two elements of P. 

Let C be a global cut. Then, there is exactly one non-empty node z/ of 
such that C divides Moreover, C contains one or two sets of | edges 

corresponding to an edge of a cycle that was implanted instead a node p in the 
neighborhood of v in , To model these cuts, the halo node of Gi> that was 
implanted into edge {v,p} of is “opened”, which means the halo node is 
deleted and corresponding pairs of edges are merged. See Fig. [2F for illustration. 
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Suppose now that every edge in the tree 71- of a local model that is contained 
in k elements of P is replaced by 3 — fc parallel edges and that every tree edge 
of 7’*’'^® that was not contracted after implanting is replaced by a pair of parallel 
edges. Then, the 2 - and 3-cuts of the resulting 2-level cactus G model the A- and 
(A -I- l)-cuts of G, respectively. This is the 2-level cactus tree model for the even 
case. 

4 Planarity of the 2-Level Cactus 

Odd connectivity. In Sect. we have shown that a tree with additional edges, 
such that the projection paths fulfill the properties of Lemma [2] is planar. We 
cannot apply Item [2] of Lemma |2] for A = 3 and, it turns out that in this case the 
2 -level cactus is not planar in general. For example (see Fig. [HK,b)j if we take 
G = 7 ^ 4 , then the extended 2 -level cactus is K^. 

In case of A > 5, the tree extended by the above auxiliary edges is 
planar, by Lemma [21 However, when A = 5, implanting the local model might 
destroy planarity. An example is shown in Fig.|3b,d. The 2-level cactus in Fig.|31i 
contains a subdivision of a with the 5 white nodes as nodes of the . 




Fig. 3. Example of planar graphs of a) connectivity 3 and c) connectivity 5, and their 
non-planar extended 2-level cacti b) and d). is represented by the location of the 
vertices and nodes. In the extended 2-level cacti, white nodes are empty nodes, black 
edges are tree-edges, dashed edges represent the generating paths, and double edges 
are those of the implanted local model. 



To show that for A > 7, implanting the local models preserves planarity, we 
will use the following trick. First, we will consider all cycles of G and modify 
them in G~^ ■ Then, we show that Lemma [T] can be applied to the thus modified 
extended 2-level cactus G~^- Second, we will restore the original extended 2-level 
cactus G~^, and show that planarity is preserved. We start with the following 
observation (see also [ 6 | Lemma 5.2). 

Lemma 5. Let ei and be two edges of a generating path. Let Vi, V 2 and 
V be the set of vertices of G that are mapped on the connected eomponents of 
G — { 61 , 62 } such that V' induces the (A -I- l)-cut that is modeled by { 61 , 62 }. 
Then, there are exactly edges eonnecting Vi and V 2 . 
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Proof. Let e, ei and 62 be the number of edges between Vi and V2, Vi and V ^ and 
V2 and F', respectively. Then we have e + Ci = A, e + 62 = A and £i + C2 = A + 1. 
Thus e = □ 

An edge of an implanted local model is called a tree-edge if it is contained 
in a 2-cycle, and it is called a cycle-edge if it is contained in a cycle of length 
greater than 2. Applying Lemma Eland Lemma El we can show 

Lemma 6. 

1. A projection path contains at most one edge of each simple cycle. 

2. Let A > 5. Each cycle-edge is contained in at most one projection path. 

3. Let A > 7. Each tree-edge of an implanted local model is contained in at most 
two projection paths. 

Proof. 

1. Let p be a projection path that contains the edges {z)i, U2}, • . . , u/} 

of a simple cycle c = Vi, . . . ,Vk. Suppose p contains more than one edge of 
c. As p takes the shortest path on c, it follows that k > 1. Let Vi,V 2 ,Vi, 
and Vk be the subsets of V that are mapped on the connected components 
of C/ - {{ui,ti2},{ui-i,'yi},{ui,ui+i},{?;fe,'!;i}} such that a vertex of is 
mapped on Vi. Then, the two cuts that are induced by Vi U V 2 and by 
V 2 LIV 1 , respectively, are crossing (A -I- l)-cuts. By Lemma E] there are no 
edges between Vi and Vj. On the other hand, let us consider the generating 
path Cl, 62 , . . . , e^, corresponding to p. By Lemma El applied to the pair 
{ei,6r}, there are at least edges between Vi and Vj, a contradiction. 

2. Let c be the set of edges of a simple cycle in a local model Gi, and let {ui, 62} 
be an edge in c. For i = 1, 2, let M be the set of nodes in the connected 
component of ^ — c that contains Vi, and let Vj be the subset of V that is 
mapped on Afi. Suppose edge {ui, 62} is contained in at least two projection 
paths Pi and p2- Let ej,i,j = 1,2, be the edge on pi with end nodes in Afj 
that was incident to n in before implanting Gi/ . By Lemma [2] Item [T] 
we know that e\ yf 62 or ef e|, so we may assume e^. Let v.p^ and 
?;p2 be the end-nodes of p\ and p 2 in M 2 . For * = 1, 2, let G pi be the edge 
incident to Up . , and let and Vp^ be the subsets of V2 that are mapped on 
the connected components of G — {61,62} that contain Up. . See Fig. |4 }l for 
illustration. Then, by Lemma|5]there are edges between V} and Vp^ and 
between V\ and Vp^, respectively, and thus, there are at least A — 1 edges 
connecting Vi and V2- On the other hand, by Lemma c(Vi,V2) = 
contradicting the precondition that A > 5. 

3. Let 6, e' be a 2-cycle of a local model, and let e be the number of projection 

paths containing either e or e'. Let Vi and t^2 be the sets that are mapped 
on the connected components oft/ — |e, e'j. With the same argumentation 
as in Item|2] per projection path that contains 6 or e', there are at least 
distinct edges between Vi and V2. As {Vi, V2} induces a (A -I- l)-cut, e must 
fulfill the inequality e • < A -I- 1. For A > 7, it follows e < 3. □ 
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Fig. 4. a) Illustration of the proof of Lemma |6] Item 2. b), c) Modification of the cycles 
in the local model b) in case not all and c) in case all cycle edges are contained in a 
projection path, that is indicated by a dashed curve, d) Illustration of the final proof 
of planarity in the odd case. 



Let us consider the following modifications on the cycles. Let c be a simple 
cycle. If c contains edges that are not contained in a projection path, we choose 
one of these edges and declare it to be an auxiliary edge. If each edge of c is 
contained in a projection path, we replace c by a star. That means, we delete all 
edges of c, add an additional vertex Vc to and connect all vertices of c to Vc- 
See Fig. |H3,c for illustration. We call the so modified graph Q' and we denote 
by -S" 3 S' the set of all auxiliary edges of G' and by T' the spanning tree of Q' 
that is induced by the edges that are not in S'. 

The only case that e is contained in the projection path of an edge in S' \ S is 
if e is a cycle-edge of a local model. Thus, by the above lemma and by Lemma |2] 
G' with its spanning tree T' fulfills the condition of Lemma[l] Hence, G' is planar. 

It remains to show that we can restore the deleted cycle-edges into G' without 
producing edge crossings. Let c be the set of edges of a cycle of G that we 
have replaced by a star. Let be an edge of c. We will show, that in any 

embedding of G', edges {v,Vc} and {w,Vc} are neighboring in the cyclic order 
around Vc- Suppose not: then v and w divide the adjacency list of Vc into two 
non-empty parts S and T. As the graph induced by c — {v,w} is connected, 
there is an edge {s,t} in c with s G S and t gT. Let p be the projection path 
that contains edge {u,w}. Let Cp S S' be the auxiliary edge that represents p 
and let p' be the projection path of Cp in G' ■ Similarly, let q be the projection 
path that contains edge let e, be the corresponding edge in S and q' the 

corresponding path in G' ■ Then, on one hand, the two cycles induced by the edge 
sets Cp = p' U 6p and Cq = q' U Cg are vertex disjoint except Vc- If not, the graph 
that is induced by Cp U Cq has at least four faces and thus, the graph induced by 
p' U q' , which is a sub graph of tree T', would contain a cycle. But on the other 
hand, there is an edge of Cq in the inner part of Cp and an edge of Cq in the outer 
part of Cp, contradicting the planarity of G'- See Fig. 3}i for illustration. 
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Even connectivity. The corollary to MacLane’s planarity criterion shows that a 
tree with additional edges, such that the projection paths fulfill the properties of 
Lemma|4]is planar. Thus, the local models are planar. Because the local models 
are stuck together in a tree-structure, proceeding from a leaf, we can open the 
halo nodes without losing planarity. 

Connectivities One and Two. Finally, for completeness, we consider the pro- 
totypes of the 2-level cactus. The cut model of the inclusion minimal 1- and 
2-cuts was introduced in |8]; it is a tree of 
edges and cycles and thus, is planar. In the 
case A = 2, a cut model for all minimum 
and minimum-l-1 cuts is described in 13 ; it 
is constructed in a similar way as the 2-level 
cactus in the even case. In general, it is not 
planar. An example is a AT 4 with every edge broken into two by a new vertex. 
This graph and its non-planar 2-level cactus is shown on the right. We summarize 
this section in the following theorem. 

Theorem 2. For a connected multi-graph with edge-connectivity X = 1 or with 
even edge-connectivity A > 4 the 2-level cactus is planar, for odd edge-connectivity 
A > 7 the extended 2-level cactus is planar, and for all other connectivities there 
are examples of non-planar (extended) 2-level cacti. 

5 Final Remarks 

For an extended planar 2-level cactus (the odd case), it would be nice if: 

(a) the auxiliary edges are all on the same (outer) face and 

(b) each cycle that is generated by an auxiliary edge and its projection path is 
the boundary of a face. 





Fig. 5. a) A graph of odd connectivity A = 7 and its 2-level cactus G. There is no 
embedding of G such that all dashed auxiliary edges are on the same face, b) A graph 
of even connectivity A = 4 and its 2-level cactus G. All non- grey nodes belong to a 
single local model Gv. There is no embedding of Gv such that its halo nodes (the white 
nodes) are on the same face. With an increasing number of vertices, both examples can 
be extended to arbitrary odd or even connectivity, respectively. 
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In this case, Q is completely contained inside a planar drawing of , and the 
projection path that corresponds to an auxiliary edge can be discovered more 
easily. Moreover, the interior of each simple cycle of a local model will then be 
empty. In the even case we wish that: 

(a) the halo nodes of any local model are all on the same (outer) face of 0,^. 

In this case, the tree structure of can be better recognized in a planar 

drawing of G- 

Unfortunately, there are examples (see Fig. 0 and Fig. that show that 
the above mentioned properties of the (extended) planar 2-level cactus are not 
true in general. However, it follows from Mac Lane’s planarity criterion that in 
the even case. Property (a) is true for the 2-connected components of the local 
models. In the odd case, by subdividing each edge of that is contained in 
more than two projection paths as illustrated in Fig. Eb, the extended 2-level 
cactus G'^ can be modified in such a way that the result is still a linear sized 
model for the minimum and minimum -|- 1 cuts of G, and has an embedding 
such that 

(c) for each auxiliary edge e, the cycle on the edge set e U Pe is a part of the 
boundary of some face /. 




Fig. 6. a) A graph of odd connectivity A > 3 and its 2-level cactus G- Vertical edges 
refer to edges, diagonal edges refer to edges, and horizontal edges refer to one 
edge each. There is no embedding of G with the property that, for every simple cycle c 
that is generated by a dashed edge and its projection path, there is a face / such that 
c is part of the boundary of /. b) Splitting the edge that is contained in more than 2 
generating paths achieves this property. 
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Abstract. In this paper we show that a very simple and efficient ap- 
proach can be used to solve the all pairs almost shortest path problem on 
the class of weakly chordal graphs and its different subclasses. Moreover, 
this approach works well also on graphs with small size of largest induced 
cycle and gives a unihed way to solve the all pairs almost shortest path 
and all pairs shortest path problems on different graph classes includ- 
ing chordal, strongly chordal, chordal bipartite, and distance-hereditary 
graphs. 



1 Introduction 

Let G = (V,E) be a finite, unweighted, undirected, connected and simple (i.e., 
without loops and multiple edges) graph. Let also \V\ = n and \E\ = m. The 
distance d(v, u) between vertices v and u is the smallest number of edges in 
a path connecting v and u. The all pairs shortest path problem is to compute 
d(v, u) for all pairs of vertices v and u in G. 

The all pairs shortest path (APSP) problem is one of the most fundamental 
graph problems. There has been a renewed interest in it recently (see | 1I2I4I6I1^ . 
|15I29| for general (unweighted, undirected) graphs, and |3I5I7I10I11I14I20I21I24| . 
I28I32I33I for special graph classes). For general graphs, the best result known is 
by Seidel |^, who showed that the APSP problem can be solved in 0{M{n) 
log n) time where M (n) denotes the time complexity for matrix multiplication 
involving small integers only. The current best matrix multiplication algorithm 
is due to Coppersmith and Winograd m and has 0(n^-^^®) time bound. 
In contrast, the naive algorithm for APSP performs breadth-first searches from 
each vertex, and requires time 0{nm) which is 0{n^) for the dense graphs. 

Given the fundamental nature of the APSP problem, it is important to con- 
sider the desirability of implementing the algorithms in practice. Unfortunately, 
fast matrix multiplication algorithms are far from being practical and suffer from 
large hidden constants in the running time bound. There is interest therefore in 
obtaining fast algorithms for the APSP problem that do not use fast matrix 
multiplication. The currently best combinatorial (i.e., in graph-theoretic terms) 
algorithm for the APSP problem is an 0(n^/ log n) time algorithm obtained 
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by Basch et AL [^. This offers only a marginal improvement over the naive 
algorithm. 

Since an algorithm for the APSP problem would yield an algorithm with 
similar time bound for Boolean matrix multiplication, obtaining a combinatorial 
time algorithm for the APSP problem would be a major breakthrough. 
Nowadays, many researchers take approach to consider the all pairs almost short- 
est path (APASP) problem in general graphs or to design (optimal) 0{v?) time 
algorithm for special well structured graph classes (which are interesting from 
practical point of view) . 

Awerbuch et al [4| and Cohen m considered the problem of finding 
stretch t all pairs paths, where t is some fixed constant and a path is of stretch t if 
its length is at most t times the distance between the endvertices. They presented 
efficient algorithms for computing t-stretch paths for t > 4. A different approach 
was employed by Aingworth et al [T]. They described a simple and elegant 
0{n^^‘^y/logn) time algorithm for finding all distances in unweighted, undirected 
graphs with an additive one-sided error of at most 2. They also make the very 
important observation that the small distances, and not the long distances, are 
the hardest to approximate. Recently, Dor et al [Tg improved the result of 
Aingworth et al PQ and obtained an 0(min{n^/^m^/^, n^/^}) time algorithm 
for finding all distances in unweighted, undirected graphs with an additive one- 
sided error of at most 2 (here, 0(/) means 0(/polylog n)). A simple argument 
shows that computing all distances in G with an additive one-sided error of at 
most 1 is as hard as Boolean matrix multiplication (see Hg). 

Besides, efficient algorithms have been developed for solving the APSP prob- 
lem on some restricted classes of graphs. Optimal 0{ii?) time algorithms were 
proposed for interval graphs I8I24I28I33I . circular arc graphs H33|, permutation 
graphs [M], bipartite permutation graphs [TO], strongly chordal graphs |5I14I20| . 
chordal bipartite graphs [21], distance-hereditary graphs [Mj, and dually chordal 
graphs |7]. The optimal parallel algorithms for the APSP problem for some cer- 
tain classes of graphs can be found in I10I14I20I32I . 

The approach to the shortest path and other optimization problems in met- 
ric spaces (e.g. in a simple rectilinear polygon endowed with some metric) taken 
by many researchers first finds a visibility graph of the metric space which con- 
tains a shortest path between any two given points. Motwani et al firnE] 
studied two classes of rectilinear polygons and showed that the visibility graphs 
corresponding to them are chordal or weakly chordal, respectively. More recently, 
Everett and Corneil m have shown that the visibility graphs of spiral poly- 
gons are interval graphs (the definition of the various families of graphs will be 
presented in the next section). 

This motivated researchers to look at distances in chordal graphs |8l11l20l32j . 
Han et al showed in [20] that the APSP problem for a chordal graph G can be 
solved in 0{n^) time if the square G^ of G is already given. They also note that 
computing G^ for chordal graphs is as hard as for general graphs, and present 
few subclasses of chordal graphs for which G^ can be computed more efficiently. 
From results in [Sj it follows that within 0{n^) time bound one can compute all 
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distances in chordal graphs with an additive error of at most 2. Even more, given 
a pair of vertices x,y G V, after a simple linear time preprocessing, in only 0(1) 
time one can compute d{x, y) with an additive error of at most 2. Sridhar et 
AL showed in that using a more sophisticated linear time preprocessing of a 
chordal graph, the distance between any two vertices which is at most 1 greater 
than the shortest distance can be computed in 0(1) time. 

In this paper we show that a very simple and efficient approach can be used 
to solve the APASP problem on the class of weakly chordal graphs and its 
different subclasses. Furthermore, we show that this approach works well also on 
graphs with small size of largest induced cycle and gives a unified way to solve 
the APASP and APSP problems on different graph classes including chordal, 
strongly chordal, chordal bipartite, and distance-hereditary graphs. 

The rest of this paper is organized as follows. We begin in Section 2 by 
presenting some definitions and a simple 0{n?) time algorithm for computing 
all pairs almost shortest path distances in graphs with special vertex orderings. 
In Section 3, we prove that this algorithm 

— computes all distances 

• in House-Hole-free graphs with an additive one-sided error of at most 2, 

• in House-Hole-Domino-free graphs (and hence in chordal graphs) with 
an additive one-sided error of at most 1, and 

— solves the APSP problem for distance-hereditary graphs, chordal bipartite 
graphs, strongly chordal (and hence interval) graphs. 

In this section we also show that the APSP problem for a House-Hole-free graph 
G can be solved in 0{-n?) time, if the square of G is given. All these classes are 

subclasses of the weakly chordal graphs. Finally, in section 4 we consider graphs 
G with small size of largest induced cycle and show that our generic algorithm 
computes in O(n^) time all distances in G with an additive one-sided error of at 
most k — 1, where k is the size of largest induced cycle in G. Hence, for weakly 
chordal graphs (even for Hole-free graphs), all distances can be computed in 
0{v?) time with an additive one-sided error of at most 3. 

2 Preliminaries and the Algorithm 

The (open) neighborhood of a vertex v is the set N{v) = {u G V : uv G E} and 
the elosed neighborhood is fV[u] = N{v) U {u}. A path is a sequence of vertices 
(t>o, . . . , vi) such that ViVi+i G E for i = 0, . . . , ^ — 1; its length is 1. We say that 
this path connects vertices vq and vi. An induced path is a path where ViVj G E ii 
and only if j = j — 1 and j = 1, ... ,1. A cycle is a sequence of vertices (vq, . . . , Vk) 
such that uo = Vk and ViVi+i G if for i = 0, . . . , A: — 1; its length is k. An induced 
cycle is a cycle where ViVj G if if and only if \i — j| = 1 (modulo k). A hole is 
an induced cycle of length at least 5. 

We now define the various graph families that will be used in this paper. 
A graph G is chordal if there is no induced cycle of length greater than 3 in 
G. A Hole-free graph is a graph which contains no holes. A graph G is weakly 
chordal if both G and the complement of G are Hole-free graphs. A bipartite 
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Fig. 1. 



weakly chordal graph is called chordal bipartite. An interval graph is the in- 
tersection graph of intervals of a line. A graph is called AT -free if it does not 
have an asteroidal triple, i.e. a set of three vertices such that there is a path be- 
tween any pair of them avoiding the closed neighborhood of the third. A graph 
is House-Hole-free (HH-free) if it contains no induced houses or holes and is 
House-Hole-Domino-free (HHD-free) if it contains no induced houses, holes or 
dominos (see Figure[T]). A distance-hereditary graph is a HHD-free graph which 
does not contain 3-fan as an induced subgraph. Finally, a graph G = (F, E) is 
strongly chordal if it admits a strong elimination ordering, namely, an ordering 
vi,V 2 , ■ ■ ■ ,Vn of V such that for each i, j, k and if z < j, k < I, Vk, vi S 
and Vk G then vi G N[vj]. For more information on these graph classes 

we refer the reader to | 9I19| 

For computing approximate distances in these graph classes we use the fol- 
lowing simple O(n^) time algorithm. Let G = (V,E) be a graph and a = [vi,V 2 , 

. . . , Vn\ be an ordering of the vertex set V of G. Let also a{v) be the number 
assigned to a vertex v in this ordering. Denote by mn(y) the maximum neighbor 
of V, i.e., the vertex of with maximum number in a. 

Algorithm APASP 

for j = 1 to n do d{vi, Vi) ~ 0; 

for i = n — 1 downto 1 do 
for j = n downto i -\-l do 

if ViVj € E then d{vi,Vj) := 1 else d{vi,Vj) ~ d{mn{vi),Vj) + 1; 

return distances d. 

We assume that ordering a used in the algorithm has the property that for 
all Vi with i < n, mn(vi) ^ Vi (otherwise algorithm will not work properly). This 
property holds for all orderings considered in this paper. 

Note that this method is not new. It was already used to compute all pairs 
shortest path distances in strongly chordal graphs (see [Hj ) and in dually chordal 
graphs (see 0) . For both classes the characteristic orderings were employed. Here 
we show that this method gives the exact distances or a very good approximation 
of distances in many other graph classes. 

Clearly, the algorithm can be modified to produce paths of length d rather 
than merely returning the approximate distances. Nevertheless, we present a 
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separate procedure which for any two vertices u, v returns the path of length 
d{u, v) along which the APASP algorithm computes. Its complexity is 0{c ■ 
d{u,v)), where c is the necessary time to verify the adjacency of two vertices). 

Procedure (mn— path(w, v)) 

if u and v are adjacent then return (u, v) 
else if cr(u) < cr(v) then return (u,mn-path(mn(u), v)) 
else return (mn-path(u, mn(n)), n) 



The path between u and v returned by the procedure mn-path(u, v) will be 
called the maximum-neighbor path. Throughout this paper, by d{u, v) we denote 
the (real) distance between u and v and by d(ti, v) the approximate distance 
returned by the algorithm. 

In this paper we will employ three types of vertex orderings: breadth-first- 
search orderings, lexicographic-breadth-first-search orderings of Rose et AL |3T| 
and lexical orderings of Lubiw |23| . 

Let a = [vi,V 2 , ■ ■ ■ , u„] be any ordering of the vertex set of a graph G. We will 
write a < b whenever in a given ordering a vertex a has a smaller number than 
vertex b. Moreover, {ai,---,a;} < {bi, ■ ■ ■ ,bk} is an abbreviation for Oi < bj 
{i = 1, • • • , /; j = 1, • • • , fc). Let m be a vertex of G. We define the layers of G 
with respect to vertex u as follows: Lfiu) = {u : d{u, v) = i} for i = 0, 1, 2, . . .. 

In a breadth-first search (BFS), started at a vertex u, the vertices of a graph 
G with n vertices are numbered from n to 1 in decreasing order. The vertex u is 
numbered by n and put on an initially empty queue of vertices. Then a vertex 
V at the head of the queue is repeatedly removed, and neighbors of v that are 
still unnumbered are consequently numbered and placed onto the queue. Clearly, 
BFS operates by proceeding vertices in layers: the vertices closest to the start 
vertex are numbered first, and the most distant vertices are numbered last. BFS 
may be seen to generate a rooted tree T with vertex u as the root. A vertex v is 
the father in T of exactly those neighbors in G which are inserted into the queue 
when V is removed. 

An ordering a of the vertex set of a graph G generated by a BFS will be called 
a BFS-ordering of G. Denote by f(y) the farther of a vertex v with respect to a. 
The following properties of a BFS-ordering will be used in what follows. Since all 
layers of V considered here are with respect to u, we will frequently use notation 
Li instead of Lfiu). 



(PI) 

(P2) 

(P3) 

(P4) 



If a; G Li, y G Lj and i < j, then x > y in u. 

li V € Lq {q > 0) then f{v) G and f{v) is the vertex from 

N{v) fj Lq-i with the largest number in a, i.e., f{v) = mn{v). 

If X > y, then either mn{x) > mn{y) or mn{x) = mn{y). 

If x,y,z G Lj, X > y > z and mn{x)z G E, then mn{x) = 
mn{y) = mn{z) (in particular, mn{x)y G E). 



Lexicographic breadth-first search (LexBFS), started at a vertex u, orders 
the vertices of a graph by assigning numbers from n to 1 in the following way. 
The vertex u gets the number n. Then each next available number k is as- 
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signed to a vertex v (as yet unnumbered) which has lexically largest vector 
(s„, s„_i, . . . , Sfe+i), where = 1 if u is adjacent to the vertex numbered i, and 
Si = 0 otherwise. An ordering of the vertex set of a graph generated by LexBFS 
we will call a LexBFS-ordering. It is well-known that any LexBFS-ordering has 
property (P5) (cf.|^). Moreover, any ordering fulfilling (P5) can be generated 
by LexBFS [TT]. 

If a < 6 < c and ac G E and be ^ E then there exists a vertex d 
' such that c < d, db G E and da ^ E. 

Lubiw in |23| has shown that any graph admits a lexical ordering, namely 
an ordering of V such that 

('PR'i If a < 6 and ac G E and be ^ E then there exists a vertex d such 
that c < d, d G N[b] and da ^ E {d = b is allowed). 

Clearly, any lexical ordering is a LexBFS-ordering, and any LexBFS-ordering 
is a BFS-ordering (but not conversely). Note also that for a given graph G, both a 
BFS-ordering and a LexBFS-ordering can be generated in linear time |19| . while 
to date the fastest method - doubly lexical ordering of the (closed) neighborhood 
matrix of G m - producing a lexical ordering of G takes 0(m log n) [22] or 
0(n'^) |2S] time. 

3 Distances in Graphs 

from the Weakly Chordal Hierarchy 

In this section, we consider different subclasses of weakly chordal graphs. The 
weakly chordal graphs itself will be considered in the next section. First we 
present the following simple but very useful lemma which holds for arbitrary 
graphs. 

Lemma 1. Let there exist integers t > 2 and s > 0 such that 

(1) for all x,y gV with x < y and d{x, y) > t, d{x, y) = d{mn{x),y) + 1 holds, 

(2) for all x,y gV with d{x, y) < t, d{x, y) < d{x, y) + s holds. 

Then d{x, y) < d{x, y) + s holds for all x,y G V. 

Proof. The proof is by induction on d{x,y). Assume that d{x,y) < d{x,y) + s 
holds for all x,y G V with d{x,y) < k {k > t), and consider a pair x,y such 
that d{x, y) = k and x < y. According to APASP algorithm we have d{x, y) = 
d{mn{x),y) + 1. Since d{mn{x),y) = d{x,y) — 1, by induction, d{mn{x),y) < 
d{mn{x),y) + s holds. Therefore, d{x, y) < d{mn{x),y) + s + 1 = d{x, y) + s. □ 

Let P = (xg, Xi, - ■ ■ , Xk-i,Xk) be an arbitrary path of G and a be an ordering 
of the vertex set of this graph. The path P is monotonic (with respect to a) if 
xq < xi < ■ ■ ■ < Xk-i < Xk holds whenever xg < Xk, and P is convex if there 
is an index i {1 < i < k) such that a;o < < • • • < Xi-\ < Xt > Xi+i > ■ ■ ■ > 

Xk-i > Xk. Vertex Xi is called the switching point of the convex path P. Let now 
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P = {xq, ■ ■ ■ , Xk) be a shortest path of G connecting xq and Xk- We say that P is 
a rightmost shortest path if the sum a(xo) + cr(xi) + • • • + cr(xk) is largest among 
all shortest paths connecting xq and Xk- 

3.1 Employing LexBFS Orderings 

Let G be a HH-free graph and cr be a LexBFS-ordering of G. By P 4 we denote 
a path on four vertices. 

Lemma 2 (|16]L G has no induced P 4 = (c,a,b,d) with {a, 6} < {c,d} in a. 

Corollary 1. Let a,b,c be three distinct vertices of G such that a < {b,c}, 
ab, ac G E and be E. Then there is a vertex d > {6, c} adjacent to b and c but 
not to a. 

Lemma 3 ( |16j ). 

(1) Every rightmost shortest path of G is either monotonic or convex. 

(2) Let P = (xo, ■ ■ ■ ,Xk) be a rightmost shortest path in G which is convex and 
Xi be the switching point of P. Then d{xo,Xi) > d{xk,Xi) if Xq < Xk- 

Lemma 4. Let x,y be a pair of vertices of G such that x < y and d{x,y) > 3. 
Then d{x, y) = d{mn{x),y) + 1 holds. 

Proof. Let P = (x,xi,X 2 , - ■ ■ ,y) be a rightmost shortest path in G connecting 
vertices x and y. Since x < y and d{x, y) > 3, by Lemma|3 we have x < X\ < X 2 . 
Assume Xi yf mn{x). Vertices v = mn{x) and X 2 cannot be adjacent, since 
otherwise we can replace x\ in P with v and get a shortest path between x and 
y with larger sum, contradicting the choice of P. For path {v,x,x\,X 2 ) we have 
{x,xi} < {v,X 2 }. Hence, by Lemma [2, v and x\ are adjacent. Now we apply 
Corollary [2 to xi < {v,X 2 }, vx 2 ^ E and get a vertex u such that u > {v,X 2 }, 
uv,ux 2 G E and X\U ^ E. Furthermore, x cannot be adjacent to u because 
V = mn{x) and u > v. But then a house formed by x, v, xi,u, X 2 is induced, that 
is impossible for a HH-free graph G. □ 

Unfortunately, this result cannot be extended to weakly chordal graphs. For 
any even integer k > 4, there exist a weakly chordal graph G = {V,E), a 
LexBFS-ordering a of G, and a pair of vertices x,y G V such that x < y in a, 
d{x,y) = k but d{x,y) < d{mn{x),y) + 1. Figure |2 gives such a weakly chordal 
graph for fc = 4 (this example can easily be extended to a larger k) . 

Lemma 5. Let x,y be a pair of vertices of G such that x < y and d{x,y) = 2. 
Then 

(1) d{x,y) = d{x,y) if mn{x)y G E or there exists a vertex z G N(x)(^N(y) 
such that z < y; 

(2) d{x,y) > d{x,y) — 1 if there exists a vertex z G N(x) (^N(y) such that 
mn{x)z G E. 
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Fig. 2. A weakly chordal 
graph with a LexBFS- 
ordering; ^ 2,5 = 4. 
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Fig. 3. A LexBFS- 
ordering of a 3-sun such 
that di, 2 — di ,2 = 1. 



2 1 




Fig. 4. A LexBFS- 
ordering of a domino 
such that di, 2 — di ,2 = 2. 



Proof. (1) Clearly, if mn{x)y S E, then d{mn{x),y) = 1 and d{x, y)= d{mn{x),y) 
-1-1 = 2 = d{x,y). So, assume that mn{x)y ^ E and there is a vertex z € N{x) 
f^N{y) such that z < y. For path {mn{x),x, z,y) we have {x,z} < {mn(x),y}. 
Hence, by Lemma [2] mn(x) and z must be adjacent. Now we can proceed as 
in the proof of Lemma and construct an induced house in HH-free graph G, 
which is impossible. 

(2) We may assume that mn{x)y ^ E and for the vertex z G N{x) f]N{y) 
with mn{x)z G E, z > y holds. We have d{mn{x),y) = 2 and y < z < mn{x). 
Hence, by (1), d{y,mn{x)) = d(y,mn{x)), and therefore d{x,y) = d{mn{x),y) + 
1 = d{mn{x),y) -|- 1 = 3 = d{x, y) -I- 1. □ 

Lemma 6. Let x,y be a pair of vertiees such that x < y, d{x, y) = 2, mn{x)y ^ 
E, and there exists a vertex z G N{x) p| N{y) with mn{x)z ^ E and z > y. Then 
G contains a domino as an induced subgraph and d{x,y) = d{x,y) — 2 holds. 

Proof. Let v = mn{x), u = mn{z) and w = mnfy) be the maximum neighbors 
in cr of a;, z and y, respectively. We have x < y < z < v, vz, vy ^ E. By property 
(P3), we have also u > w > v and hence xu,xw ^ E. Since {a;,z} < {u, it}, by 
Lemma [2 vertices v and u must be adjacent. 11 uy G E then we get an induced 
house in G, which is impossible. Therefore, d(t/, it) = 2 and we can apply Lemma 
|5] to ?/ < z < It and get d{y,u) = d{y,u) = 2. That is, vertex w = mn{y) 
is adjacent to u. If now wz G E or wv G E then vertices a;, y, z, i>, it, w induce 
either a house or a cycle of length 5. Since these induced subgraphs are forbidden, 
w is adjacent neither to v nor to z, i.e. vertices x,y, z,v,u,w form an induced 
domino. 

Now consider maximum neighbors h = mn(v), s = mn{u) and t = mn{w) of 
V, It and ic, respectively. We had d{v, w) = 2, v < w < u and it G N{v) N{w). 
If also hu,hw ^ E then, as we have shown above, vertices v,u,w, h, s,t will 
form an induced domino with edges vu,vh,us,uw,wt,hs, st. Since v = mn{x), 
It = mn{z), w = mn{y) and v<w<u<h<t<s, there cannot be 
edges with one end in {a;, z, y} and the other end in {h, s, t}. That is, the cycle 
(x, z, y, w, t, s, h, V, x) of G of length 8 is induced. Since such cycles are forbidden 
in G, we must have hu G E or hw G E. 
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11 hu £ E then vertices x, z, t;, u, h induce a house, which is impossible (note 
that h is adjacent neither to x nor to z because h > u = mn{z) > v = mn{x)). 

So, it remains only to consider the case when hw S E. In this case, according 
to the APASP algorithm, we have d{h, w) = 1 and therefore d{x, y) = d{v, y) + 
1 = d{v, w) + 1 + 1 = d{h, w) + 1 + 1 + 1 = 4. Finally, d{x, y) = 2 = 4 — 2 = 
d{x,y)-2. □ 

Combining LemmasHllHEl andElwe obtain the following results for HH-free, 
HHD-free and chordal graphs. 

Theorem 1. Let G be a HH-free graph and a be a LexBFS-ordering ofG. Then, 
for all vertices v,u € V, the distances returned in d by the algorithm APASP 
satisfy the inequality 

0 < d{v, u) — d{v, u) < 2. 

Corollary 2. All distances in HH-free graphs with an additive one-sided error 
of at most 2 can be found in 0{n^) time. 



Theorem 2. Let G be a HHD-free graph and a be a LexBFS-ordering of G. 
Then, for all vertices v,u € V, the distances returned in d by the algorithm 
APASP satisfy the inequality 

0 < d{v,u) — d(v,u) < 1. 

Corollary 3. All distances in HHD-free graphs with an additive one-sided error 
of at most 1 can be found in O(n^) time. 

Corollary 4 (|32|). All distances in chordal graphs with an additive one-sided 
error of at most 1 can be found in 0{n^) time. 

Note that the bounds given in Theorems |T] and |2] are tight. Figure 0 shows 
a chordal graph with a LexBFS-ordering where the difference between d and d 
can achieve 1, Figure Ogives a similar example for HH-free graphs. In all figures 
dark edges indicate the maximum-neighbor path between vertices numbered 1 
and 2. 

For the distance-hereditary graphs this method gives the exact distances. 

Theorem 3. Let G be a distance-hereditary graph and a be a LexBFS-ordering 
of G. Then the distances returned in d by the algorithm APASP are the real 
distances, i.e., d{v,u) = d{v,u) for any v,u€V. 

Proof. According to Lemmas [H IH O, and El we have to consider only the case 
d{v, u) = 2, V < u, mn{v)u ^ E and there exists a vertex z £ N{v) fj iV(u) such 
that mn{v)z G E, z > u. 

Consider the maximum neighbor w = mn{u) of u. Since v < u and mn(v)u ^ 
E, by property (P3), we have mn{v) < w, i.e., v < u < z < mn{v) < w, 
and wv ^ E. By Lemma [21 vertex w has to be adjacent to mn(y) or z. If 
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mn{v)w G E then we get an induced house or an induced 3-fan depending on 
adjacency of w and z. Distance-hereditary graphs cannot contain such induced 
subgraphs. Hence, w is not adjacent to mn{v) but is adjacent to z. Now we 
apply Corollary [T] to z < {mn{v), ic}, mn{v)w ^ E and get a vertex t such that 
t > w, mn{v)t,tw G E and zt ^ E. Since t > w > mn{v), vertex v is adjacent 
neither to t nor to w. Thus, we have created an induced house in G formed by 
V, z, mn{v),t, w, contradicting the fact G is distance-hereditary. □ 

Corollary 5 im)- The all pairs shortest path problem in distance-hereditary 
graphs can be solved in 0{n^) time. 

LemmalHallows us also to state the following interesting result. All what one 
needs to compute the exact distances in a HH-free graph G in 0{n^) time is to 
know in advance the square G^ of G. Recall that the square of G = (V,E) 
is the graph with the same vertex set V where two vertices u and v (u ^ v) are 
adjacent if their distance in G is at most 2. Unfortunately, computing G^ even 
for split graphs (a subclass of chordal graphs) is as hard as for general graphs. 

Theorem 4. Let G be a HH-free graph and a be a LexBFS-ordering of G. Let 
also the square G^ of G is given. Then the all pairs shortest path problem on G 
can be solved in 0{n^) time. 

Proof. Let cr = [m, U 2 , ■ • ■ , 'Cn] be a LexBFS-ordering of G. Clearly (see Lemma 
0, the following modification of APASP algorithm computes the exact distances 
in G. 

for i = 1 to n do d{vi, vf) ;= 0; 
for i = n — 1 downto 1 do 
for j = n downto i -|- 1 do 

if ViVj £ E{G) then d{vi,Vj) := 1 
else if ViVj € E[CP) \ E(G) then d(vi,Vj) := 2 
else d{vi,Vj) := d{mn{vi),Vj) -\- 1; 
return distances d. 

If the adjacency matrix of G^ is given, this algorithm clearly runs in 0{n^) time. 

□ 

Corollary 6. The all pairs shortest path problem on a HHD-free graph G can 
be solved in O(n^) time if G^ is known. 

Corollary 7 ( |20| ). The all pairs shortest path problem on a chordal graph G 
can be solved in 0{n^) time if G^ is known. 

3.2 Employing Lexical Orderings 

In this subsection we consider strongly chordal graphs and chordal bipartite 
graphs. 

Theorem 5 (). Let G be a strongly chordal graph and a be a lexical ordering 
of G. Then the distances returned in d by the algorithm APASP are the real 
distances, i.e., d{v,u) = d{v,u) for any v,u€V. 
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Proof. Since any lexical ordering of a graph is a LexBFS-ordering and strongly 
chordal graphs are HH-free (even chordal) graphs, Lemmas [4| and [5| can be 
applied. According to those lemmas and Lemma [T] we have to consider only 
the case when v < u, d{v,u) = 2 and mn{v)u ^ E. We claim that this case is 
impossible. 

It is well-known that any lexical ordering of a strongly chordal graph 
is a strong elimination ordering. Consider a vertex z G N{v)f^N{u). We have 
zv, zu,mn{v)v G E, z < mn{v),v < u. Since mn{v)u ^ A, a contradiction with 
cr is a strong elimination ordering arises. □ 



Corollary 8 ( (14|5j ). The all pairs shortest path problem in strongly chordal 
graphs can be solved in 0{n^) time. 

Since interval graphs are strongly chordal, we immediately conclude. 

Corollary 9 ( |3|24U28l33] L The all pairs shortest path problem in interval 
graphs can be solved in 0{n^) time. 

In a similar way we can prove the following (proof is omitted) . 

Theorem 6. Let G be a chordal bipartite graph and a be a lexical ordering ofG. 
Then the distances returned in d by the algorithm APASP are the real distances, 
i.e., d{v,u) = d{v,u) for any v,uGV. 

Corollary 10 (|2Tj). The all pairs shortest path problem in chordal bipartite 
graphs can be solved in 0{n^) time. 

4 Distances in Graphs with Small Size 

of Largest Induced Cycle: Employing BFS— Orderings 

In order to capture the notion of “small” induced cycles, we define a graph to 
be k-chordal if it has no induced cycles of size greater than k. Note that chordal 
graphs are precisely the 3-chordal graphs, weakly chordal graphs are 4-chordal 
graphs and AT-free graphs are 5-chordal. 

For /c-chordal graphs we obtained the following results, which we present 
here without proofs. It is interesting to note that even simple BFS-orderings 
succeed in getting a good approximation for distances in A:-chordal graphs (for 
small fc > 4). 

Theorem 7. Let G be a k-chordal graph and a be a BFS-ordering of G. Then, 
for all vertices v,u G V, the distances returned in d by the algorithm APASP 
satisfy the inequality 

0 < d{v, u) — d{v, u) < k — 1. 

Corollary 11. All distances in k-chordal graphs with an additive one-sided er- 
ror of at most k — 1 can be found in 0{n^) time. 



114 Feodor F. Dragan 



Corollary 12. All distances in hole-free graphs with an additive one-sided error 
of at most 3 can be found in O(n^) time. 

Corollary 13. All distances in weakly chordal graphs with an additive one-sided 
error of at most 3 ean be found in O(n^) time. 

This result can be strengthened further for AT-free graphs. We can prove the 
following. 

Theorem 8. Let G be an AT-free graph and a be a BFS-ordering of G. Then, 
for all vertices v,u G V , the distances returned in d by the algorithm APASP 
satisfy the inequality 

0 < d{v, u) — d{v, u) < 2. 

Corollary 14. All distances in AT-free graphs with an additive one-sided error 
of at most 2 can be found in O(n^) time. 

In Figures El El and [3 we present three graphs with BFS-orderings which 
show that the bound stated in Theorem 0 is tight at least for 4-, 5-, and 6- 
chordal graphs. Since all orderings shown in figures are even LexBFS-orderings, 
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Fig. 5. A weakly chordal graph 
with a LexBFS-ordering; di ,3 — 
di,3 = 3. 



Fig. 6. A 5-chordal graph 
with a LexBFS-ordering; 
di.3 — di,3 = 4. 
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Fig. 7. A 6-chordal graph 
with a LexBFS-ordering; 
di,3 — di,3 = 5. 
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considering LexBFS instead of BPS in Theorem |3 will not give any better bound 
(for fc > 4). Furthermore, the domino with a LexBFS-ordering presented in 
Figure m shows that the bound in Theorem [8] for AT-free graphs is tight, too. 

Note that distances in /c-chordal graphs were already considered in El . It was 
shown that for any fc-chordal graph G = (V, E) there exists a tree T = {V, F) 
such that \dG{u,v) — dT{u,v)\ < [k/2\ + a for all u,v G V, where a = 1 if 
A: 4, 5, and a = 2 otherwise. Such a tree T can be constructed in linear time 

0(ji + m). 
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Abstract. We show that many non-MSOi NP-hard graph problems can 
be solved in polynomial time on clique-width and NLC-width bounded 
graphs nsing a very general and simple scheme. Our examples are parti- 
tion into cliques, partition into triangles, partition into complete bipar- 
tite subgraphs, partition into perfect matchings, partition into forests, 
cubic subgraph, Hamiltonian path, minimum maximal matching, and 
vertex/edge separation problems. 



1 Introduction 

Graphs of bounded clique-width are defined by three operations for vertex- 
labeled graphs, the vertex disjoint union G 0 of two graphs, the addition 
of edges r]ij{G) between vertices labeled by i and vertices labeled by j, and the 
relabeling pi^j{G) of vertices labeled by i into vertices labeled by j, see jS|. 
Graphs of bounded NLG-width are defined by two similar operations, the union 
G Xs H of two graphs that additionally creates all edges between vertices from 
G labeled by i and vertices from H labeled by j for every pair (i,j) C S, and the 
relabeling o/j(G) of vertices labeled by i into vertices labeled by R{i), see [TT] . 
The clique-width or NLG-width of a graph is the minimum number of labels 
needed to define it. Every graph of clique- width at most k has NLG-width at 
most k and every graph of NLG-width at most k has clique-width at most 2k, 
see |S]. 

Glique-width and NLG-width bounded graphs are especially interesting from 
an algorithmic point of view. A lot of NP-hard graph problems can be solved in 
polynomial time for graphs of bounded clique-width or bounded NLG-width if 
an expression for the graph is explicitly given. For example, all graph properties 
which are expressible in monadic second order logic with quantifications over 
vertices and vertex sets (MSOi-logic) are decidable in linear time on clique- width 
and NLG-width bounded graphs, see [^. The MSOi-logic has been extended by 
counting mechanisms which allow the expressibility of optimization problems 
concerning maximal or minimal vertex sets, see |^. All these graph problems 

* The work of the second author was supported by the German Research Association 
(DFG) grant WA 674/9-1. 
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expressible in extended MSOi-logic can also be solved in linear time on clique- 
width and NLC-width bounded graphs. 

If a graph G has clique-width or NLC-width at most k then the edge com- 
plement G has clique-width at most 2fc, see [^, and NLC-width at most k, see 
El- The set of all graphs of clique-width at most 2 or NLC-width 1 is the set of 
all labeled cographs. The clique-width and NLC-width of permutation graphs, 
interval graphs, grids and planar graphs are not bounded by some fixed integer 
k, see 1^. An arbitrary graph with n vertices has clique-width at most n — r, 
if 2'' < n — r, and NLC-width at most |"^], see |^. Every graph of tree-width 
at most k has clique-width at most 3 • 2^“^, see [^, and NLC-width at most 
2^“''^ — 1, see m- The graphs of clique-width at most 2 or NLC-width 1 do not 
have bounded tree- width. In [Zj, it is shown that every graph of clique-width or 
NLC-width at most k which does not contain the complete bipartite graph 
for some n > 1 as a subgraph has tree-width at most 3k{n — 1) — 1. The recog- 
nition problem for graphs of clique-width or NLC-width at most k is still open 
for fc > 4 and fc > 3, respectively. Clique- width of at most 3 is decidable in time 
0{n^m), where n is the number of vertices and m is the number of edges of the 
input graph, see jT]. NLC-width of at most 2 is decidable in time 0(n"^ log(n)), 
see 0. 

There are many NP-hard graph problems which are not expressible in ex- 
tended MSOi-logic, but which are, nevertheless, solvable in polynomial time on 
clique-width and NLC-width bounded graphs, see also dH. In this paper, we 
extend this problem list by partition into independent sets/cliques (chromatic 
number), partition into independent sets/cliques of bounded size (partition into 
triangles), partition into complete bipartite subgraphs, partition into perfect 
matchings, partition into forests, degree bounded subgraph problems (the cu- 
bic subgraph problem), the Hamiltonian path problem, the minimum maximal 
matching problem, and various vertex/edge separation problems. All these prob- 
lems are not expressible in extended MSOi-logic. The input of our algorithms 
is always a clique-width or NLC-width expression. Note that clique-width and 
NLC-width expressions can simply be transformed into each other. 

The aim of this paper is to illustrate the power of our simple scheme to design 
such polynomial time algorithms. We also demonstrate that it is sometimes 
much more convenient to use an NLC-width expression instead of a clique-width 
expression. This is especially the case for degree bounded subgraph problems 
and vertex/edge separation problems. The proofs of the correctness are more or 
less straight forward and are omitted in this version due to space limitations. 
We are not interested in bounding the constants in the running times of the 
algorithms which are sometimes more than double exponential in k (the clique- 
width of the graphs) . Such considerations seem to be only interesting if the time 
complexity can be reduced to 0(n'^), where c is independent of k. Up to now, 
no such algorithms are known for the examples of this paper. 
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2 Preliminaries 



Let [A:] := {1, . . . , fc} be the set of all integers between 1 and k. We work with 
finite undirected labeled graphs G = {V, E, lab), where 1^ is a finite set of vertices 
labeled by some mapping lab : V ^ [k] and E C {{m, n} | u,v € V, u ^ v} is a 
finite set of edges. For a vertex set U C V let lab([/) = {lab(tt) | u G U} and 
lab^(C/) be the set of all labels i such that U has at least two vertices labeled 
by i. A labeled graph J = {V ,E' ,\ah') is a subgraph of G if V' C V, E' C E 
and lab'(tt) = lab(u) for all u G V . J is an induced subgraph of G if additionally 
E' = {{u,t;} G if I u,v G V'}. The labeled graph consisting of a single vertex 
labeled by i G [A:] is denoted by 

The notion of clique-width for labeled graphs is defined by Courcelle and 
Olariu in [Sj. 

Definition 1 (Clique- width, |^). Let k be some positive integer. The class 
CWk of labeled graphs is recursively defined as follows. 

1. The single vertex graph for some i G [A:] is in CWk. 

2. Let G = {Vc, Ec, labc) G CWk and J = (Vj, Ej, labj) G CWk be two vertex 
disjoint labeled graphs. Then G© J := {V',E', lab') defined by V' := VgUVj, 
E' := Eg U Ej, and 



lab' {u) 



f labc(u) ifuGVc 
\ labj{u) if uGVj 



Vw G V' 



is in CWk. 

3. Let i,j G [A;] be two distinct integers and G = (V, E, lab) G CWk be a labeled 
graph then 

(a) rjij{G) := {V,E',lab) defined by 

E' := E U {{u,t)} I u,v gV, lab{u) = i, lab{v) = j} is in CWk and 

(b) pi^j{G) := {V,E,lab') defined by 



lab' {u) 



J lab{u) if lab{u) i 
\ j if lab{u) = i 



tjuGV 



is in CWk. 

The notion of NLC-width0 of labeled graphs is defined in m- 

Definition 2 (NLC-width, Illl L Let k be some positive integer. The class 
NLCk of labeled graphs is recursively defined as follows. 

1. The single vertex graph for some i G [A;] is in NLCk. 

2. Let G = {Vg, Eg, labc) G NLCk and J = (Vj,Ej,labj) G NLCk be two 
vertex disjoint labeled graphs and S C [A:]^ be a relation, then G Xg J := 



^ The abbreviation NLC results from the node label controlled embedding mechanism 
originally defined for graph grammars. 
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{V',E',lab') defined by V' := Vq U Vj, E' := Eq U Ej U {{m,d} | u G 
Vq, V G Vj, {laba{u),labj{v)) G 5}, and 



is in NLCk- 

3. Let G = {V,E, lab) G NLCk and R : [k] ^ [k] be a function, then oji(G) 

:= {V,E,lab') defined by lab'{u) := R{lab{u)), WuGV is in NLCk- 

The clique-width (NLC-width) of a labeled graph G is the least integer k 
such that G G CW^ (G G NLCfc, respectively). An expression X built with 
the operations Pi^j,r]ij for integers i,j G [k] is called a clique-width k- 

expression. An expression X built with the operations •i,xs,OR for i G [k], 
S C and i? : [fc] — 7 > [k] is called an NLC-width k-expression. The graph 
defined by expression X is denoted by val(A). 

The polynomial time algorithms we want to introduce for NP-hard graph 
problems on clique-width and NLC-width bounded graphs are based on the 
following scheme, see also El. Let 7T be a graph problem for labeled graphs. If 
there is a mapping E that maps every clique-width ^-expression X onto some 
abstract structure E{X) such that for all /c-expressions X,Y (1.) the size of 
F{X) is polynomially bounded in the size of X, (2.) the answer to LI for val(A) 
is computable in polynomial time from F{X), (3.) A(*i) is computable in time 
0(1), (4.) F{X © Y) is computable in polynomial time from F{X) and F{Y), 
and (5.) F{rjij{X)) and F{pi^j{X)) are computable in polynomial time from 
F{X) then for every clique- width fc-expression X, the answer to 77 for val(A) 
is computable in polynomial time from X. This works also for NLC-width k- 
expressions built with the operations X 5 , and instead of *i, ®, rjij, and 
Pi^j. Then F{X xrY) has to be computable in polynomial time from F{X) 
and F{Y), and 7"(o^(A)) has to be computable in polynomial time from F{X). 

In the next section, we frequently use the notion of a multi set, i.e., a set that 
may have several equal elements. For a multi set M with elements x\, . . . ,x-n 
write M. = {x\, . . . , a;„). There is no order on the elements of M.. The number 
how often an element x occurs in Ni is denoted by ip{A4, x). Two multi sets Aii 
and M 2 are equal if for each element x G M\ U M 2 , ip{Mi,x) = tp{M 2 ,x), 
otherwise they are called different. The empty multi set is denoted by (). 

For two distinct integers i,j let Ri^j : [k] — >• [fc] be defined by Ri^j{f) = t 
t i, and Ri^jff) = j \i t = i. For 77 : [7] — >■ [fc] and L C [7] let R{L) = 
{77(7) I 7 G L} and pi^j{L) = Ri^j{L). For L C [7]’’, r > 1, let 77(L) = 
{(77(7i),...,77(7t.)) I (7i,...,7t.) G 7/} and Pi^j i^L) — R^—^j (7/) . 

3 Examples 

Problem 1 (partition into independent sets (into cliques)). 

INSTANCE: Graph G = {V, E), positive integer r < \V\. 

QUESTION: Is there a partition of V into r disjoint sets Vi, . . . ,Vr such that 
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ViU ■ ■ ■ UVr = V and no set V*, 1 < t < r, has two vertices joined by an edge 
(every set Vt, 1 < t < r, induces a complete subgraph, respectively)? 

Let G = (V,£l,lab) = val(X) for some clique-width fc-expression X. For a 
disjoint partition of V into independent sets Vi , . . . , K- let M. be the multi set 
(lab(Vi), . . . ,lab(K)). Let F{X) be the set of all multi sets M for all disjoint 
partitions of vertex set V into independent sets. Then F{X) is polynomially 
bounded in the size of X, because F{X) has at most (|1/| -|- 1)^ multi sets 
each with at most \V\ nonempty subsets of [fc]. The following observations show 
that F{»i) is computable in time 0(1), F{X © Y) is computable in polynomial 
time from F{X) and F{Y), and F{riij{X)), and F{pi^j{X)) are computable in 
polynomial time from F(X). 

1. F(.,) = {({*})} 

2. Starting with set O = {()} x F{X) x F{Y), extend D by all triples that can 
be obtained from some triple {Ai, Ai' , Ai") G D hy removing a set L' from 
At' or a set L” from Ai” and inserting it into Ai, or by removing both sets 
and inserting L' U L” into Ai . 

D gets at most (|1?| + 1)^1^ triples and thus is computable in polynomial 
time. 

F{X®Y) = {Ai I (M, (),()) SO} 

3. F(r,,,,(A)) = {(Li,...,L,)eF(X) | {*, j} 2 L* for t = 1, . . . , r} 

4. F(p,^,(A)) = {(p,^,(Li),...,ft^,(L,)) I (Li,...,L,) eF(A)} 

There is a partition of the vertex set of val(X) into r independent sets if and 
only if there is some Ai G F{X) consisting of r label sets. A partition of G = 
{V,E,lah) into cliques corresponds to a partition of its edge complement G = 
(V, E, lab), E = {{u, n} I u,v G V, u V, {u, z;} ^ E}, into independent sets. A 
2fc-expression X for val(A) can easily be obtained from X in polynomial time, 
see |3]. This proves the following result. 

Theorem 1. Given a clique-width k-expression X for some fixed integer k, then 
the problems partition into independent sets and partition into cliques for graph 
val{X) are solvable in polynomial time in the size of X. 

The problem partition into independent sets (chromatic number) is also 
shown in | 10| to be solvable in polynomial time on clique- width bounded graphs. 
Due to its simplicity, we have used the chromatic number problem as our intro- 
ducing example. Note that our solution differs from that given in 1101 . 

Problem 2 (partition into independent sets (into cliques) of bounded size). 
INSTANCE: Graph G = (V,E), positive integer s < \V\. 

QUESTION: Is there a partition of V into disjoint sets Vi, . . . ,Vr of size s such 
that ViU ■ ■ ■ UVr = V and no set U) i t < r, has two vertices joined by an 
edge (every set Vj, 1 < t < r, induces a complete graph of G, respectively)? 

Let s be some fixed integer. For a disjoint partition of V into independent sets 
Ui, . . . , Ur of size < s let A4 be the multi set ((lab(Ui), |Ui|), . . . , (lab(Ur), |Ur|)). 
Let E{X) be the set of all multi sets Ai for all disjoint partitions of vertex set 
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V into independent sets of size < s. Then F{X) has at most (|y| + 

multi sets each with at most \V\ tuples (L,x), where L is a nonempty subset of 

[fc] and 1 < a; < s. 

F{»i), F{X © Y), F{rjij{X)), and F{pi^j{X)) are computable in the same 
way as for the problem partition into independent sets. The only difference worth 
to emphasize is the computation of F{X ©T) in the case where we remove both 
tuples, the tuple {L',x') from M' and the tuple {L'',x") from M" . In this case, 
we insert (L' U L" , x' + x"') into M. only if x' + x" < s. 

There is a partition of val(X) into independent sets of size s if and only if 
there is some M. G F(X) such that for all (L,x) G Xi, x = s. This and the 
edge complement property of clique-width bounded graphs proves the following 
theorem. 

Theorem 2 . Given a clique-width k-expression X for some fixed integer k, then 
the problems partition into independent sets of hounded size and partition into 
cliques of bounded size for graph val{X) are solvable in polynomial time. 

The problem partition into cliques of size three is also called partition into 
triangles. 

Problem 3 (partition into complete bipartite graphs). 

INSTANCE: Graph G = (V, E), positive integer r < \V\. 

QUESTION: Is there a partition of V into r disjoint sets Vi, . . . ,Vr such that 
Ui U • • • U Ur = V and each set U, I < t < r, induces a complete bipartite graph? 

Let Vip, Vip , . . . , Vrp, Ur, 2 be a disjoint partition of vertex set U into inde- 
pendent sets such that the subgraphs Gt induced by the vertices from Vtp U U,2 
are bipartite (not necessarily complete bipartite) for t = 1 , . . . , r. Let Xi be the 
multi set that has a triple {Li,L2,L-pj) for every pair (U,i,U,2) defined as fol- 
lows. Li = lab(Ut,i), L2 = lab(Ut,2), and Ljj is the set of all label pairs (i,j) such 
that there are two nonadjacent vertices u G U,i, v G Vtp labeled by lab(M) = i, 
lab(u) = j. Let F{X) be the set of all multi sets Xi defined as above for all 
disjoint partitions of vertex set U into bipartite graphs. Then F{X) has at most 

(|U| + i) 2'‘-2'‘-2*’ multi sets each with at most |U| triples (Li, L2, L-^), where Li 
and L2 are subsets of [k] and is a subset of [k] x [k] . 

F{»i), F{X © Y), F{rjij{X)), and F{pi^j{X)) are computable as follows. 

1. F(.,) = {((W,0,0),(0’,W,0))} 

2 . Starting with set D = {()} x F{X) x F{Y) extend D by all triples that 
can be obtained from some triple {Xi, Xi' , Xi") G D hy removing a triple 
(Lj, L2, L'—) from Xi' or a triple {L'(, L'), L'X) from Xi" and inserting it into 

or by removing both triples and inserting (L'^ UL", L'2UL'), L'—UL'XU 
L'l X L'f U L'( X L'2) into Xi. 

F{X(BY) = {Xi I {Xi,{),{))GD} 

3 . We say Xi G F(X) is feasible for operation pij if for every (Li, L2, ^ 

{i,j} 2 Xi and {i,j} % X2. 

If Xi is feasible for operation ryp then let rjij{Xi) be the multi set obtained 
from Xi by removing (i,j) and (j,i) from all sets Ljj for all {Li, L2, X^) G 

Xi. 
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F{riij{X)) = | M G F{X), M feasible for rjij} 

4. For operation pi^j and M G F{X), let pi^j{M) be the multi set obtained 
from M by substituting every triple (Li, L 2 , G Af by 

F{p,^j{X)) = {p,^,{M) I M G F{X)} 

There is a partition of val(X) into r complete bipartite graphs if and only if 
there is some M G F{X) consisting of r triples (Li,L2,0) such that Li and 
are nonempty. 

Theorem 3. Given a clique-width k-expression X for some fixed integer k, then 
the problem partition into complete bipartite graphs for val{X) is solvable in 
polynomial time. 



Problem 4 (partition into perfect matchings). 

INSTANCE: Graph G = (V, E), positive integer r < \V\. 

QUESTION: Is there a partition of V into r disjoint sets Vi, . . . , Pj. such that 
ViU ■■ - UVr = V and each set Vt, I <t < r, induces a perfect matching? 

A vertex set U' C U is called edge independent if the subgraph G' of G 
induced by V' does not contain two incident edges. Let Vi, ... ,Vr be a disjoint 
partition of vertex set V into edge independent sets. For such a partition of V 
let M be the multi set that has a 4-tuple (L>i, L> 2 , L_e, L[/) for every vertex 
set Vt, I < t < r, defined as follows. L>i = lab(Vt), L >2 = lab^(Vj), Le is the 
set of all label sets {i,j} such that the subgraph Gt of G induced by the vertices 
of Vt has two adjacent vertices labels by i and j, respectively, and is the 
set of all labels i such that Gt has a vertex labeled by i that is not incident to 
any edge of Gt. Let F{X) be the set of all multi sets M defined as above for 
all disjoint partitions of vertex set V into edge independent sets. Then F{X) 

has at most (|U| -I- ■ 2 ’^ gg^g most \V\ 4-tuples 

{Lyi, Ly 2 , Le, Lij), where L>i is a nonempty subset of [A:], L >2 and are 
subsets of [fc], and Le is a subset of {{i,j} \ i,j& [A:]}. 

F(*j), F{X © Y), F{riij{X)), and F{pi^j{X)) are computable as follows. 

1. F(.,) = {((W,0,0,W))} 

2. Starting with set D = {()} x F{X) x F{Y) extend D by all triples that 

can be obtained from some triple G D by removing a 4-tuple 

Ly 2 , L'e, L'jj) from M' or a 4-tuple {L'f^,L'f 2 ,L'f.,L'(j) from Xi” and 
inserting it into AI, or by removing both 4-tuples and inserting 

(L>i U L'fi, L>2 U L>2 U (L>i n L'f,f), L'e U L'e, L'y U L'{j) 

into M. 

F{X®Y) = {M I {M,{),{))€D} 

3. We say M G F{X) is feasible for operation pij if for every 4-tuple 
{L>x,L> 2 ,Le,Lu) G M the following holds. If {i,j} C L>i then {i,j} C 
L>i — L >2 and either {i,j} £ Le or i,j G Lfj. If Xi is feasible for operation 
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rjij then let r]ij{Ai) be the multi set obtained from Ai by the following 
modification of the 4 -tuples (L>i, L>2, Ly) G Ai. If i,j G L>i — L>2 
and i,j G Lu then remove i,j from Ljj and insert {i, j} into Le- 
F{rjij{X)) = {r]ij{Ai) \ Ai G F{X), Ai feasible for rjij} 

4 . For operation pi^j let pi^j (Ai) be the multi set obtained from Ai by the 
following modification of the 4 -tuples (L>i, L>2, Le, L^) G Ai. If i,j G F>i 
then insert j into L>2- Then relabel every label i into label j in every set 
L>i, L>2, Lu, and Le- 
F{p,^j{X)) = {p.^,{Ai) I Ai G F{X)} 

There is a partition of val(X) into r perfect matchings if and only if there 
is some Ai G F{X) consisting of r 4 -tuples {L>i, L>2, Le, Lu) such that Lu is 
empty. 

Theorem 4 . Given a clique-width k-expression X for some fixed integer k, then 
the problem partition into perfect matchings for val{X) is solvable in polynomial 
time. 

Problem 5 (partition into forests). 

INSTANCE: Graph G = (V, E), positive integer r < \V\. 

QUESTION: Is there a partition of V into r disjoint sets Vi,. . . ,Vr such that 
Vi LI ■ ■ ■ LI Vr = V and each set Vt, I <t < r, induces a forest? 

Let Vi,. . . ,Vr be a disjoint partition of the vertex set V such that the sub- 
graphs Gt of G induced by the vertices from Vt are forests for t = 1 , . . . ,r. For 
such a partition of V let Ai be the multi set that has a triple (L>i, L>2, Ft) for 
every vertex set Vt, I < t < r, defined as follows. L>i = lab(Vj), L>2 = lab^(Vj), 
and Lt is the set of all triples {K>i, K>2, K-j^) such that Gt has a maxi- 
mal connected component (a tree) T = (Vr,ET,labT) and AT>i = lab(Vr), 
K>2 = lab^(Vr), and is the set of all label sets {i,j} such that Vr has two 
nonadjacent vertices labeled by i and j, respectively. Let F{X) be the set of 
all multi sets Ai defined as above for all disjoint partitions of vertex set V into 
forests. 

The size of F{X) is polynomially bounded in the size of X, because F{X) 

has at most (|U| -I- i)(2'‘-i)'2'‘-2^ ^ ^ multi sets each with at most \V\ triples 
{L>i, L>2, Lt), where L>i is a nonempty subset of [fc], L>2 is a subset of [k], 
and Lt is a set of triples {K>i, K>2, Kj^) where K>i is a nonempty subset of 
[fc], K>2 is a subset of [fc], and Ai;gr is a subset of {{i,j} \ i,j& [fc]}. 

F{»i), F{X ©U), F{rnu{X)), and F{pi^j{X)) are computable as follows. 

1. F(-.) = {((W,0,{(W,0,0)}))} 

2 . Starting with set D = {()} x F{X) x F{Y) extend D by all triples that 
can be obtained from some triple {Ai, Ai' , Ai") G D by removing a triple 

L>2, L'rp) from Ai' or a, triple (L" L>2, L'f) from Ai" and inserting it 

into AI, or by removing both triples and inserting {L'^^ LlL'f^, L>2 U L> 2 U 
n L>i), L'rp U L'f) into Af. 

F{X(BY^ ={Ai I (AI,(),())Gi^} 
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3. We say A4 G F(X) is feasible for operation rjij if for every triple (L>i,L> 2 , 

Lt) € M, {i,j} g L >2 and for every triple {K>i,K> 2 ,K^) G ^ 

iCg, and if t G L>i —K>i then j ^ K> 2 , and if j G i>i —K>i then i ^ K> 2 - 
If Ai is feasible for operation rjij then let rjij(Ai) be the multi set obtained 
from Ai by the following modification of the triples (L>i, L> 2 , Lt) G Ai. 
Assume i G L>i — L> 2 , for j G L>i — L >2 change i and j. Then let 

(ATi,>i, ATi>2, • ■ • , Ks^> 2, Kgj^) G Lt 

be all triples such that i G Ali,>i and j G Kt^>i for t = 2, . . . , s. Remove 
these s triples from Lt and insert a new triple into Lt 

defined as follows. 

^>1= U 

l<t<S 

K> 2 = U Kt ^>2 u IJ n Art 2 ,>i, and 

l<t<S l<ti<t2^S 

= U ^t,E U U {{*',/} I G / G - {z,j} . 

l<t<S l<ti<t2^S 

F{r]ij{X)) = {r]ij{A4) \ A4 G F{X), Ai feasible for rjij} 

4. For operation pi^j let pi^j (Ai) be the multi set obtained from Ai by the 
following modification of the triples (L>i,L> 2 ,Lt) G Ai. If both labels i,j 
are in L>i or in AT>i of some triple (A">i, A1>2 j Afg) G Lt then insert j into 
L >2 or K> 2 , respectively. 

Then relabel every i into j in every set of (L>i, L> 2 , Lt). 

F{p,^j{X)) = {p.^,{Ai) I Ai G F{X)} 

There is a partition of val(A) into r forests if and only if there is some 
Ai G F{X) consisting of r triples. 

Theorem 5. Given a clique-width k-expression X for some fixed integer k, then 
the problem partition into forests for val{X) is solvable in polynomial time. 



Problem 6 (cubic subgraph). 

INSTANCE: Graph G = (E, E). 

QUESTION: Is there a nonempty subset E' C E such that in graph {V,E') 
every vertex has either degree 3 or degree 0? 

We illustrate this example with an NLC-width /c-expression instead of a 
clique-width /c-expression. For some fixed integer s and a subset of edges E' C E 
such that in G' = {V, E' , lab) all vertices have degree < s, let Ai be the following 
multi set. Ai has for every vertex u G V a, tuple (lab(u), deg(rt)), where deg(u) is 
the vertex degree of u in G'. Let F{X) be the set of all multi sets Ai for integer 
s and all such subsets E'. Then F{X) has at most (|U| -I- l)^'0+i) multi sets 
each with \V\ tuples {I, d), where I is a label from [k] and d is an integer between 
0 and s. 

E{»i), F{X Xs Y), and F(o^(A)) are computable as follows. 
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1. F(.,) = {((*, 0))} 

2. Starting with set D = {()} x F{X) x F{Y) extend D by all triples that can 

be obtained from some triple G D hy 

(a) removing a tuple {I', d!) from Xi' or (/", d") from M." and inserting it 
into M. or by 

(b) removing a tuple (Z', d') from Xi' and choosing at most r, 1 < r < s — d', 
tuples (Z", d'l ), . . . , (Z", d”) from M." such that (Z', Z") G S and d” < s for 
t= 1, . . . , r, and inserting (Z', d' + r) into M and changing every (Z", d") 
into {l't,d” + 1) without removing them from Ai”. 

F{XxsY) = {M I (M,(),())Gi^} 

3. F{on{X)) = {{{R{h),di),...,mt),dt)) I {{h,d,),...,{k,dt))GM} 

There is a subgraph of val(X) with at least one edge such that all vertices 
have degree 3 or 0 if and only if there is some Ai G F{X) such that for all 
(Z, d) G Ai, d = 3 or d = 0 and there is at least one (Z, d) G Ai with d= 3. 

Theorem 6. Given an NLC k-expression X for some fixed integer k, then the 
cubic subgraph problem for val{X) is solvable in polynomial time. 



Problem 7 (Flamiltonian path). 

INSTANCE: Graph G = {V, E). 

QUESTION: Is there a simple path of length |U| in G1 

For a subset of edges E' C E such that subgraph G' = {V,E',lab) contains 
no cycles and every vertex in G' has degree at most 2, let Ai be the following 
multi set. A4 has a multi set (lab(i;i), lab(wr)) for every path p = vi,...,Vr, 
r > 1, in G', where vi,Vr have degree < 1 in G'. Let F{X) be the set of all multi 
sets Ai for all such subsets E' C E. Then F{X) has at most (|U| + 1)^ multi 
sets each with at most \V\ multi sets of size two. 

F(*i), F{X ©E), F{riij{X)), and E{pi^j{X)) are computable as follows. 

1. F(.,) = {((^,^))} 

2. F{X © E) = {Ai' U Ai" I Ai' G E{X), Ai" G E(E)} 

3. F{rjij{X)) can be computed from F{X) as follows. Starting with set D = 

F{X), extend D by all multi sets that can be obtained from a multi set Ai 
of D by removing two multi sets (* 2 , J 2 ), where i = i\ and j = j' 2 , 

and inserting the new multi set ( 12 , ji). The resulting set D is Efqi j{X)). 

4. = {((p.^,(M)) I MgM)\ AiG E{X)} 

Graph val(A) has a Hamiltonian path if and only if E{X) has a multi set 
Ai consisting of exactly one multi set. 

Theorem 7. Given a clique-width k-expression X for some fixed integer k, then 
the Hamiltonian path problem for val{X) is computable in polynomial time. 

A simple modification leads to a polynomial time solution for the Hamiltonian 
circuit problem as in m- 
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Problem 8 (minimum maximal matching). 

INSTANCE: Graph G = (V, E), positive integer r < \E\. 

QUESTION: Is there a subset E' C E with \E'\ < r such that \E'\ is a maximal 
matching, i.e. no two edges in E' share a common vertex and every edge in 
E — E' shares a common vertex with some edge in E'l 

For a subset of edges E' C E and a disjoint partition of V into three sets 
) ^2 , ^3 such that Vi is an independent set and E' is a perfect matching for 
the vertices of V2, we define a triple (lab(Ui), \E'\, (lab(wi), . . . ,lab(Ms))), where 
ha = {ui, . . . , Us}. Let E{X) be the set of all triples defined as above for all such 
edge sets and partitions. 

Then E{X) has at most 2* • • (|U| -|- 1)^ triples of size at most A:-|- 1 -h \V\. 

E{»i), F{X ©U), F{riij{X)), and F{pi^j{X)) are computable as follows. 

1. F(.,) = {(0,0, (*)),({*}, 0,0)} 

2. F{X®Y) = + (L,t,M) G F{X), 

F{Y)} 

3. F{riij{X)) can be computed from F{X) as follows. Starting with set D = 

F{X), remove all triples (L,t,Xi) from D for which {i,j} Q L, and then 

extend D by all triples (L,t+ l,Ad — ({0,{j})) for all triples in 

D, where ({z}, (jj) C M. The resulting set D is F{rn j{X)). 

4. F{p,^j{X)) = {{p,^j{L),t,p,^,{M)) I {L,t,M)G F{X)} 

There is a subset E' C E with \E'\ = r such that \E'\ is a maximal matching 
in G if and only if there is some triple {L, r, ()) € F{X). 

Theorem 8. Given a clique-width k-expression X for some fixed integer k, then 
the minimum maximal matching problem for val{X) is computable in polynomial 
time. 

Let us finally consider some vertex and edge separation problems. Let F{X) 
be the set of all 5-tuples (lab(Vi), \Vi\, |V2|,lab(V3), IV3I) for all disjoint partitions 
of vertex set V into three sets Vi , TO) L3 such that G has no edge between a vertex 
from Vi and a vertex from V3. Then F{X) has at most 2^ • (|U| + 1) • (|U| + 
1) • 2^= • (|U| + 1) 5-tuples. E(.0 = {({t}, 1, 0, 0, 0), (0, 0, 1, 0, 0), (0, 0, 0, (zj, 1)}. 
F{X XsY) is the set of all 5-tuples (Li,xi,X 2 ,Ls,X 3 ) obtained from two 5- 
tuples {L[,x'i,X 2 ,L'^,x'^) G F{X) and {L'( , x” , x'f , L'f , x'f) G F(Y) such that 
Li = L[ U L'(, xi = x[ + x'(, X 2 = x '2 -\- x'f, L 3 = L'^U L”, x^ = x'^-\- x'f, if 
there is no pair (z,j) G S such that i G L( and j G L'f or z G Lg and j G L'{. 
F{or{X)) = |(i?(Li),a;i,X2,i?(L3),X3) | (Li, xi, 0:2, Lg, 2:3) G E(A)|. 

The vertex separation problem whether G = val(A) has a vertex set V2 of 
size < r such that G without the vertices of V2 can be partitioned into two 
disjoint sets Ui,V3 of size < a|U| such that there is no edge between vertices 
from V\ and vertices from V3, for some given r and a, can now be solved in 
polynomial time. 

Let F{X) be the set of all triples 

((lab(zzi), . . . ,lab(zzr)),a;, (lab(zzr+i), . . . ,lab(zz„))) 
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for all disjoint partitions of = {u \, . . . , u„} into Vi = {ui , . . . , Ur} and V 2 = 
{ur+i , . . . , Un} such that X is the number of edges between the vertices from Vi 
and V 2 . Then F{X) has at most (|F| + 1)* • {\E\ + 1) • (|t^| + 1)^ triples. 

F{»i) = {((i),0, 0), (0,0, (i))}. F{XxsY) is the set of all triples 
obtained from two triples G F{X) and (A^", x", G ^0^) such 

that Ml = M'l U M'[, M 2 = M '2 U M' 2 , and 

x = x'Fx"f ^ i){M\,i) ■i){M2-,i) + ^ tp{M'2,i) ■ ip{M'l,j). 

{i,MS {i,MS 

F{oji{X)) = {{R{Mi),x,R{M2)) I (Mi,xM2) G F(X)}. 

The edge separation problem whether there is a partition of the vertex set 
of G = val(X) into two disjoint sets Vi, V 2 of size < a\V\ such that there are at 
most r edges between vertices from Vi and vertices from V 2 , for some given r 
and a, can now be solved in polynomial time. 
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Abstract. We establish two results for (g, /)— factorizations: 

1. every {mg + m — 1, mf — m + 1)— graph with 9k/4 < g < f has a 
(g, /)— factorization orthogonal to any set of k given edge disjoint 
m— subgraphs of G; 

2. every (0, mf — m+ 1)— graph G with / > fe + 2 has a (0, /)— factori- 
zation orthogonal to any set of k given vertex disjoint m— subgraphs 
of G. Polynomial-time algorithms to find the claimed orthogonal 
factorizations under the above conditions are presented. 



1 Introduction 

Consider a simple graph G = {V,E). Let da{x) be the degree oi x € V. For 
any two integer-valued functions g and / defined on V such that Vx S U : 
g{x) < f{x), a (g,/)— factor of G is a spanning subgraph F of G satisfying 
g{x) < drix) < f{x) for all x S V{F). In particular, if G is a (g, /)— factor itself, 
then G is called a {g, /)— graph. A (g, /)— factorization T — {Fi, F2, . . . , Fm} of 
G is a partition of F into edge disjoint (g, /)— factors Fi, F2, ..., F^,. Let F be 
a subgraph of G with m edges (called an m— subgraph of G). A factorization 
F = {Fi, F2, ..., Fm} of G is orthogonal to H if |F(F) fl F{Fi) \ = 1, 1 < f < m. 

The factorization problem can be viewed as a generalization of the edge col- 
oring problem. Imposing the condition orthogonal to a given subgraph (or k 
subgraphs) introduces to the study an appealing feature for application areas. 
The study of orthogonal factorization problems was first proposed by Alspach 
et al.[I], to formulate practical problems that found applications in combina- 
torial design, network design, circuit layout, etc.. The decision problem of the 
orthogonal factorization problem is, in general, NP-complete[il 

Early studies in this area have been focused on the factorization orthogonal 
to one subgraph. Recently, Lam, et al.jB], studied the more general problem of 

* Research is partially supported by a grant from the Research Grants Council of 
Hong Kong SAR (CityU 1074/00E) and a grant from CityU of Hong Kong (Project 
No.7001215). 

^ Reduction from the edge coloring problem to it can be done by setting H as the set 
of edges incident to a maximum degree node, and setting g to be all zeros and / to 
be all ones. 
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finding factorizations orthogonal to each of k subgraphs. More precisely, they 
consider the following problem: given subgraphs Hi,H 2 , ■■■, of G, does there 
exist a factorization orthogonal to each Hi,l < i < kl They proved that, for any 
vertex-disjoint m— subgraphs iJi, H 2 , ..., Hk of an {mg+m—1, m/-TO-|-l)— graph 
G, there exists a (g, /)— factorization of G orthogonal to every Hi,l < i < k, 
where k < g{x) < f{x) for every x £ V . The requirement of vertex-disjoint 
subgraphs is rather stringent in practical applications. As an open problem, Lam, 
et ah, asked that, if Hi, H 2 , ■■■, Hi^ are mutually edge-disjoint m— subgraphs of 
G, will the above conclusion still hold? In this paper, we answer the question in 
the affirmative under the condition 9fc/4 < g < /. 

Another question that left open in their work is when g is allowed to be 
all zeros. We also study this for (0,m/ — m + 1)— graphs and prove that: for 
any vertex-disjoint m— subgraphs Hi, H 2 , ■■■, Hk of a (0, mf — m + 1)— graph G, 
there exists a (0, /)— factorization of G orthogonal to each Hi, 1 < i < k, where 
k + 2 < f{x) for all x €V. 

In Section 2, we introduce some preliminary results necessary for our study. 
In Section 3, we present our work on factorizations orthogonal to k edge disjoint 
graphs. In Section 4, we prove our extension for (g, /)-factorization with g = 0 
(for k vertex disjoint graphs). In Section 5, we conclude our paper with discus- 
sion on open problems and on polynomial-time algorithms for finding the above 
{g, /) —factorizations. 



2 Preliminaries 



For S C V, let E{S) = {xy € E{G) : x,y G S}. Suppose S,T CV with SdT = 0, 
and Ei,E 2 C E with Ei D E 2 = 0. Let Eg{S,T) = {(s,t) G E : s G S,t G T} 
and eG{S,T) = \Eg{S,T)\. Let D = V\{SUT). 

Let 

E'i = EinE{S), E'{ = EinE{S,D)-, 

E'2 = E2nE{T), A" = A2nA(r,G); (1) 

Hi = G[E[U E”], H 2 = G[A' U A"]. 

And let 

aG{S,T-Ei,E2) =2\E'i\ + \E'l\= ^ 

x^S 

Pg{S,T-,Ei,E2) = 2|A'| + |A"| = E cIhAx), (2) 

xGT 

AG{S,T;Ei,E 2 ) = aG{S,T; Ei, E2) + ( 3 g{S,T; ^ 1 ,^ 2 ). 



For simplicity, we use a, (3, A for aG{S,T-, Ei,E 2 ), Pg{S,T; Ei, E 2 ) and Ag{S, 
T-, Ei,E2). 

Moreover, for any function u and any set S, we denote ^ u(x) by u(S). 

xGS 




(g, /)— Factorizations Orthogonal to k Subgraphs 131 



We define auxiliary functions u, w and p, q on V. For each x € V, let: 

u(x) = max{g(x), do(x) — (m — l)f(x) + m — 2}, 

w(x) = min{ f(x), do (x) - (m - l)g(x) -m + 2}. 

p(x) = max{0, da (x) — (m — l)f(x) + m — 2}, ' ' 

q(x) =min{f(x),da(x)}. 

From the definitions of u, w and p, q it is easy to see: 

Lemma 1. For any integer m > 2, and for each x S V : 

g{x) < u{x) < w{x) < f{x). (4) 

Lemma 2. For any integer m > 2, and for each x S V : 

0 <p{x) < q{x) < f{x). (5) 



For all X G V, define: 

f2i{x) = da{x) /m — u{x), fi 2 {x) = w{x) — da{x)/m] (6) 

Ai{x) = dG{x)/m-p{x), A 2 {x) = q{x) - da{x)/m. (7) 



Lemma 3. For any disjoint subsets S,T ofV, we have(l^): 



6a{S,T;u,w) = f2i{T) + 1?2(5') + dG-s{T){m - 1) /m + da-riS) /m. (8) 

Sa{S, T;p, q) = Z\i(T) + Z\2(5) + da-s{T){m - l)/m + da-T{S)/m. (9) 

We should also make use of the following lemma which gives a sufficient and 
necessary condition for a graph to have a {g, /)— factor that includes some given 
edges and excludes some other given edges |B]. 

Lemma 4. Let G = (V,E) be a graph, g and f be integer-valued functions 
defined on V such that for each x G V, 0 < g{x) < f{x) < da{x). Let E\ and E 2 
be two disjoint subsets of E. Then G has a (g, f)— factor F such that E\ C E{F) 
andE 2 C\E{F) ^ % if and only if 5a{S,T-, g, f) = da{T)-eG{S,T)-g{T)-Gf{S) > 
A for all disjoint subsets S and T ofV. 

3 Factorization Orthogonal to k Edge Disjoint Sets 

Theorem 1. Let G be an {mg + m — 1, mf — m + 1)— graph, where g and f be 
integer-valued functions defined on V satisfying 9k/A < g < f. Let F[i,H 2 , Hk 
be edge disjoint m— subgraphs of G. Then there exists a {g, f)— factorization of 
G orthogonal to each Hi,l < i < k. 
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Proof (of Theoreml^. Choose UiVi G E{Hi),l < i < k. Let Ei = {uiVi : 1 < 

/c 

i < k}, i?2 = U \ Ex. Then \Ex\ = k, |i? 2 | = {m — l)k. For any two 

i—1 

disjoint subsets S and T of V, E'x,E”, E' 2 , E'f, a, /3 and A are defined as before. 
According to the definitions of a and /?, we have: 

a < min{2fc, /3 < min{2(m — l)fc, (m — l)fc|T|}. (10) 



Define functions and flx^^i on V as before. We have the following claim. 
Claim 1. For every x £V and m > 2, we have 



f2i{x) > 



1/m : if u{x) > g{x) and dcix) = mf{x) — m + 1, 

1 — 1/m, : else; 



( 11 ) 



J 1/m, : ifw{x)<f{x)anddG{x)=mg{x) + m-l, 

-\l-l/m, : else. 

Proof (of Claim[^. If u{x) > g{x) and dcix) = mf{x) — m + 1, then 

f2i{x) = dG{x)/m - u(x) = dG(x)/m - dG(x) + (m - l)f(x) - (m - 2) , , 

= (mf(x) — m + 1)(1 — m)/m + (m — l)f(x) — m + 2 = 1/m. ^ 

If u(x) > g(x) and dG(x) < mf{x) — m, then 

I7i(x) = dG{x)/m - u{x) = dG{x)/m - dG{x) + (m - l)f(x) - (m - 2) , . 

> (m/(x) — m)(l — m)/m + (m — l)/(a;) — m + 2 = 1. ' ' 

If u{x) = g{x), then 

f2i{x) = dG{x)/m — g{x) > {mg{x) + m — 1) /m — g{x) = 1 — 1/m. (15) 

The second inequality can be proven similarly. □ 



Lemma 5. Let G be an {mg + m — 1, mf — m + 1)— graph, where m > 2. Then 
G has a (u, w)— factor F such that Ex C E{F) and E 2 H E{F) = 0. 

Proof (of Lemma\^. According to Lemma |4] it is enough to show for all disjoint 
subsets S, T of V, 6g{S,T; u,w) > a + /3. 

If S' = 0, according to Lemma [3] and Claim [Jl we have 

Sg{S,T; u, w) = dG{T){m - l)/m + J7i(T) 

> {mg{T) + (m — l)|r|)(m — l)/m + |T|/m (16) 

> (m — l)fc|r| > (3 = a + /3. 

Similarly when T = 0, we have 
5g{S,T-,u,w) 

= dG{S)/m+ D 2 {S) > {mg{S) + (m — l)|S|)/m + |S|/m 
> k\S\ > a = a + (3. 



(17) 
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Therefore, we should assume S and T 7 ^ 0. 
Let 



So = {x ^ S \ w{x) < f{x) and dc{x) = mg{x) + m — 1}, 
Tq = {x ^ T \ u{x) > g{x) and dc{x) = mf{x) — m + 1}. 



(18) 



Let Si = S\So a.ndTi=T\ Tq. 

If [S’! > 2 and |T| > 2, then choose x GT such that |i ?2 D) \ is as small 

as possible and choose y G S such that \Ei r\E{y, D)\ is as small as possible. By 
Lemma [3] and Claim [Jl we have 

Sg{S, T; u,w) 

= Ci(T) + 172 ( 5 ') + dG-s{T){m - l)/m + dG-T{S)/m 

> (|To| + (m - 1 )|Ti|)/to+ (|5o| + (m - l)|5i|)/m 
+ dG-s{T){m - l)/m + dG-T{S)/m 

= ((m - 2)(|5i| + |Ti|))/m + (|5| + dG-s{T))/m 
+ (|T| + dG-T{S))/m + dG-s{T){m - 2)/m 

> ((m — 2)(|5i| + |Ti|))/m + {mg{x) + m— 1 + /3 — (m — l)kl2)lm 
+ {mg{y) + to — 1 + a — k/2) jm + (3{m — 2)lm 

> ((to — 2)(|5i| + |Ti|))/to + (4to/c + 2m — 2)/m + (a + (to — 1)/3)/to 

> ((to — 2)(|5ij + |Tij))/TO + (2 (to — l)k + (to — l)2fc + dk + 2m — 2)/m 

+ (a + (to — 1)/3)/to 

> {ma + to/3) /to = a + /3. 

(19) 

If |5| = 1 or \T\ = 1, we choose x G T, y G S. According to Lemma El and 
Claim [U we have 

Sg{S, T; u,w) 

= ^i{T) + 172(5) + dG_s(T)(TO - 1 )/to + dG-T{S)/m 

> ((to - 2)(|5i| + |Ti|))/to + {mg{x) + to-1 + /3-(to- l)fc)/TO 

+ (to5(j/) + TO — 1 + a — /c)/to + /3(to — 2)/to ' 

> {7mkj2 + 2m — 2)/m + (a + (to — 1)/3)/to 

= (2(to — l)fc + 5mkj2 + 2m — 2)/m + (a + (to — 1)/3)/to. 

If |5| = 1, then a < fc|5| = k, and subsequently we have 



Sg{S,T-,u, w) > (to/3 + ma)/m + {mk/2 + k + 2m — 2)/m 
> cr -t“ /3. 



(21) 



If \T\ = 1, then /3 < (to — l)A;|r| = (to — l)k, and subsequently we have 

Sg{S,T', u, w) > {ma + to/3) /to + {mk/m + k + 2m — 2)!m , . 

>a + /3. 

Therefore, Lemma El is proven. □ 

Now we prove Theorem [TJ 

We apply induction on to. The claim is trivial when to = 1. Suppose the 
theorem is true for to — 1, where to > 2. We will prove the theorem is also true 
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for m. According to Lemma [H G has a (u,w)— factor F such that Ei C E{F) 
and E 2 n E{F) = 0. From Lemma [T] F is also a ((;,/)— factor. In addition, for 
all X G V, we have: 

d,G-F{x) 

= dcix) — dpix) > dcix) — w(x) 

> dclx) — dclx) + (m — 1)5(2:) + (m — 2) = (m — 1)5(2:) + (m — 1) — 1, 
da-F{x) 

= dcix) — dF{x) < dcix) — u{x) 

< dclx) — dclx) + (m — l)f{x) — (m — 2) = (m — l)/(2:) — (m — 1) + 1. 

(23) 

Hence, G — F is an {{m — 1)5 + (to — 1) — 1, (to — 1)/ — (m — 1) + 1)— graph. Let 
H- = Hi\{uiVi}, then F[[, Fl!^, Fl'i. are (to— 1)— subgraphs of G— F. According 
to induction hypothesis, G— F has a (5, /)— factorization F' = {Fi, F2, ..., F^-i} 
orthogonal to F2, ..., F(,. Then G has a (5, /)— factorization F = F' U {F} 
orthogonal to Hk- Therefore, the theorem is proven. □ 

Substitute 5 — 1 and / + 1 for 5 and / in Theorem [T] we derive the following 
corollary. 

Corollary 1. Let G be an {mg — 1,to/ + 1)— graph, where 9fc/4 + 1 < 5 < /. 
Let Hi,H 2 , be edge disjoint m— subgraphs of G. Then G has a (5 — 1,/ + 

l)—faetorization orthogonal to eaeh Fli,l <i<k. 

4 Some Extension to Factorization Orthogonal 
to k Vertex Disjoint Sets 

Theorem 2. Let k be a positive integer, G be a (0, mf — m + 1)— graph, where / 
is an integer-valued function defined on V satisfying for each x G V, f{x) > k-\-2. 
Let Fi, F2, ..., iLfe be vertex disjoint m— subgraphs of G. Then there exists a 
{0, f)— factorization of G orthogonal to each Fj,l <i<k. 

Proof (of Theorem[B). We apply induction on to. When to = 1, the theorem is 
trivial. Assume that the theorem is true for to — 1, where to > 2. We will prove 
it is also true for to. 

Choose UiVi G Hi,l < i < k. Let 



k 

Fi = {um,l< i < k}, E 2 = (IJ F(F,)) \Fi. (24) 

i=l 

We have |Fi| = k and IF2 I = (to — l)fc. For two disjoint subsets S,T of V, E[, 
F", F2, F2 , a, [3 and A are defined as before. It is easy to see from the definitions 
of a and (3 that: 

a < min{2fc, |s |}, (3 < min{2(TO — l)fc, (to — 1)|F|}. (25) 

Next, we choose S,T satisfying the following conditions 1, 2 and 3: 
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1. 6c{S,T;p, q) — Ac{S,T; El, E 2 ) is as small as possible, where p, q are defined 
as before; 

2. \T\ is as small as possible subject to 1; 

3. I ^ I is as small as possible subject to 1 and 2. 



Claim 2. IfT^ 0, then for every x € T, p{x) > 1 (and p{x) = dcix) — {{m — 
l)/(a:) — (m — 1 ) + 1 ) and daix) > (m — l)f{x) — m + 3). 

Proof (of Claim[^l. Otherwise, T* = {x G T;p{x) = 0} 7 ^ 0. Let Tq = T\T*. 
Then 

Sg{S, T-p, q) = dciT) - ec(5, T) - p{T) + q{S) 

= 6G{S,To;p,q) + dc-siT*) - p{T*) (26) 

= 6G{S,To;p,q)+dG-s{T*). 

On the other hand, 

Ag(S,T;Ei,E2) < Ag{S,To;Ei,E2) + Pg(S,T*;Ei,E2). (27) 

Therefore, 

6G{S,T;p,q)-AG{S,T;Ei,E2) . , 

> SG{S,To;p,q) - Ag{S,To;Ei,E 2 ) + dG-s{T*) - (dG{S,T*;Ei,E 2 ). ^ 

Notice that dG-s{T*) > Pg{S,T*; Ei, E 2 ), so we have 

(5g (^, T; p, q) - Ag(S,T; El, E 2 )> Sg{S, Tq; p, q) - Ag{S,To-, Ei, E 2 ), (29) 

a contradiction to the choice of T. Therefore, T* = 0 □ 



Claim 3. If S ^ 0, then for every x G S, q{x) < dG{x) — 1 (and q{x) = f{x) 
and dG{x) > f{x) + !)■ 

The proof of Claim is similar to that of Claim and hence omitted. 

For all x gV , Let Ai{x) and A 2 {x) be defined as in the section 2. 

Claim 4. If S ^ % and T 7 ^ 0, then for all x G T, 



Ai{x) > 



1/rn, : if dG(x) = mf{x) — m + 1, 

1 , : else; 



for all X G S, 



A2{x) > 



1 — 1 /m, 

1 , 



if dG{x) = mf{x) -m+1, 
else. 



(30) 



(31) 



Proof (of ClaimW. According to Claim |2] for every x G T, p(x) = dG(x) — 
((m - l)f{x) - (m - 1) + 1). Therefore, 



Z\i(a;) = dG{x)/m- p{x) 

= dG{x)/m — dG{x) + (m — l)f{x) — (m — 2 ) 
= fiG(a;)(l — m)/m + (m — l)/(a;) — (m — 2 ). 



( 32 ) 
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If dc{x) = mf{x) — m + 1, then 

Ai{x) = {mf{x) — m + 1)(1 — m)jm + (m — l)f{x) — (m — 2) = 1/m. (33) 

If (1g{x) < mf{x) — m, then 

A\{x) > {mf{x) — m)(l — m)/m + (m — l)/(a;) — (to — 2) = I. (34) 
According to Claim |3] for all x £ S, q{x) = f{x). Therefore, 

A2(x) = q{x) - d,G{x)/m = f{x) - dG{x)/m. (35) 

If dG{x) = mf{x) — TO + I, then 

A 2 {x) = f{x) — (mf{x) — TO + 1)/to = 1 — I/to, (36) 

else, 

A 2 {x) > f{x) — (mf{x) — m)/m = 1. (37) 

□ 

In order to prove Theorem [21 we will first prove that G has a (p, g)— factor 
F such that Ei C E{F) and E 2 fl E{F) = 0. According to Lemma [Hand the 
choice of S, T, it suffices to prove that 6g{S,T;p, q) > a + p. 

If S' = 0 and T ^ 0, then a = 0. According to Claim (21 

SG{S,T;p,q) = dG{T) - p{T) 

= dG{T) - {dG{T) - ((to - 1)/(T) - (to - 2)|T|)) 

= (to-1)/(T)-(to-2)|T| (38) 

> (TO-l)(fc + 2)|T|-(TO-2)|r| 

> (to — 1)|T| > /3 = /3 + a. 

If T = 0 and S ^ 0, then /3 = 0. According to Claim O 

<5g(S, T;p, q) = q{S) = f{S) > (fc + 2) |S| > |S| > a = a + /3. (39) 

If S = 0 and T = 0, then a = 0, P = 0. And subsequently, SG{S,T;p,q) = 
0 > a + p. We should therefore assume that S ^ 0 and T 0 in the rest of 
proof of Theorem [2l 

According to Lemma |3] 

SG(S,T;p,q) = Ai{T) + Aa(S) + dG-s{T){m - l)/m + dG-T{S)/m. (40) 



Let 



To = {x £ T\dG{x) = mf(x) -m+l},Ti=T\ Tq] 
So = {x £ S\dG{x) = mf{x) — TO + 1}, Si = S \ Sq. 



(41) 
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We have several cases to discuss: 

Case 1. To — 0. Now Ti = T. Notice that dc-siT) > /3 and that dc-riS) > a. 
According to Claim |4] 

SG{S,T-,p,q) = Ai{T) + A 2 {S) + dG-s{T){m - l)/m + dG-T{S)/m 

> \T\m/m + |S'|(m — l)/m + /3(m — l)/m + a/m , . 

> (/3 + |T|)/m + a(m — l)/m + /3(m — l)/m + a/m 

t> O: + /3. 



Case 2. To ^ 0 and |Ti| > k. According to Claim [21 Claim Eland Claim E] 
5g{S, T\p, q) 

= Ai{T) + ^ 2 ( 6 ') + dG-s{T){m - l)/m + dG-T{S)/m 
> (|To| + m|ri|)/m + IS'Kto - l)/m + dG-s{T){m - l)/m + dG-T{S)/m 



= 


ITiKto 


— 1) /to 


+ ( r + dG-T 


\S))/m 


+ ( '5' + dG-S 


(T))/to 


+ 


^ (to - 


-2)/m- 


f dG-s{T){m - 


- 2)/m 






> 


|Ti|(to 


— 1) /to 


+ {dG{x) + a ■ 


— 1)/to 


+ (rfc(y) + /3- 


- (to — 1)) /to 


+ 


a(m — 


2)/to + 


(3{m — 2)/m 








> 


fc(m — 


1)/to + 


{f{x) + 1 + 0 


— 1)/to 


+ {mf{y) - TO 


+ 1 + /3 - (to 


+ 


a{m — 


2)/to + 


f3{m — 2)/to 








> 


k(m — 


1)/to + 


{k + 2)/to + (: 


m{k + 2) — 2to + 2)/to 


+ 


a{m — 


l)/m + 


/3(to — 1)/to 










2mk/m + 4/to 


+ a{m — 1) / m + j3{m 


- — 1)/to 




> 


(too + 


m[3)/m 


— Q: + /?, 









where x € S, y € Tq. 

Case 3. Tq ^ 0, |Ti| < fc — 1 and |T| < fc.In this case, 

5g{S, T;p,q) 

= dG{T)-eG(S,T)-p{T) + q{S) 

= dG{T) - {dG{T) - ((m - 1)/(T) - (m - 2)|T|)) - eG(5, T) + f{S) (44) 

> (m - l)(fc + 2)|T| - (to - 2)|T| - |5||T| + (fc + 2)|5| 

> (to- 1)|T| + |5| + (A:-|T|)|A| >a + /3, 



where the second equality comes from Claim [2l 

Case 4- To yf 0, |Ti| < A: — 1 and |T| > fc + 1. We have |To| > 2 in this case. 
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If Sq = 0, then = S. Choose x,y G Tq, z G S. According to Claim H] 
Claim and Claim [d] 

Sg{S,T;p, q) 

= Ai(T) + Z\ 2 ( 5 ') + dG-s{T){m - l)/m + dG-T{S)/m 

> (|T| + (m — l)|Ti|)/m + |S'|m/m + dG-s{T){m — l)/m + dG-T{S)/m 
= |Ti|(m - l)/m + (|T| + dG-T{S))/m + (2|^| + dG-s{T))/m 

+ |>S'|(to — 2)/m + dG-s{T){m — 2)/m 

> |T'i|(m — l)/m + {dG{z) + a — l)/m + {dG{x) + dG{y) + P — 2 (m — l))/m 
+ {a + P){m — 2)/m 

> |Ti|(m - l)/m + if{z) + a)/m 

+ {m{f{x) + f{y)) — 2{m — 1) + /3 — 2m + 2)/m + (a + /3)(m — 2)/m 

> (fc + 2)/m + (2m(fc + 2) — 2m + 2 — 2m + 2)/m + (a + /3)(m — l)/m 

> (2(m — l)k + 2k + k + 6)/m + (a + P)(m — l)/m 

> (ma + m/3) /m = a + /3. 

(45) 

If So 0, then there exists a z G Sq such that dG(z) = mf(z) — m + 1. 
Choose an X G Tq. According to Claim |2] Claim |3] and Claim |4] and similar as 
the discussion above, we have 

SG(S,T;p,q) = Ai(T) + A 2 {S) + dG-s{T){m - l)/m + dG_T(S')/m 

> {mf{z) — m+l + a — l)/m + {mf{x) — m+l + /3 — m+l)/m 

+ a{m — 2)/m + P{m — 2)/m (46) 

> {m{k + 2) — m)lm + {m{k + 2) — 2m + 2)/m + (a + /3)(m — l)/m 

> o A /3. 

From the above discussion, for the chosen S, T, we always have 6g {S, T;p,q) 
> a + p. According to Lemma G has a {p, 9 )— factor F such that Ei C E{F) 
and E 2 n E{F) = 0. From Lemma E] F is also a (0, /)— factor. According to the 
definitions of p, q, we have 

dG-F{x) = dG{x) - dpix) > dG{x) - q{x) > dG{x) - dG{x) = 0 , 
dG-F{x) = dG{x) - dF{x) < dG{x) - p{x) . . 

<dG{x)-{dG{x)-{{m-l)f{x)-{m-l) + l)) 

= (m — l)f{x) — (m — 1 ) + 1 . 

Therefore, G — F is a (0, (m — l)/(a;) — (m — 1) + 1)— graph. Let iJ' = iJi \ 
{uiVi}. Then H-, 1 < i < k are vertex disjoint (m — 1)— subgraphs of G — 
F. According to induction hypothesis, G — F has a (0, /)— factorization T' = 
{Fi, F 2 , ..., Fm-i} orthogonal to F 2 , ..., E['^. It is obvious that T = T' Vd {F} 
is a (0, /)— factorization of G that is orthogonal to Fi, F 2 , ..., E[]~. 

Thus completes the proof of Theorem E □ 

Corollary 2. Let G be a (0,mf — m+1)— graph, where f > k + 2 is an integer- 
valued function defined on V. Then for any km— matching M of G, G has a 
{0, f)— factorization T = {Fi, F 2 , ..., Fm} satisfying that each Fi,l < i < m 
shares exactly k edges with M . 
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Remark 1. Let m, k and I be integers such that k < m — 1, I < m. Let G be a 
(0,m/ — fc)— graph, where / is an integer- valued function defined on V satisfying 
f > 1. Let Hi,H 2 , Hi be m— subgraphs of G. Then G has a subgraph R such 
that R has a (0, /)— factorization orthogonal to Hi, H 2 , Hi. 

Proof. Without lose of generality, we suppose E{Hi) = {e^j, 1 < 

m 

i < l. Let Fj = {e^^. ,1 < i < 1},1 < j < m. Let R = IJ T}- Then T = 

1=1 

{Fi, F 2 , ..., Fjn} is a (0, /)— factorization of R that is orthogonal to Hi,H 2 , ..., Hi. 

□ 

Remarks and Discussion Anstee 0 gave a polynomial-time algorithm which ei- 
ther finds a {g, /)— factor or shows that one does not exist in 0 (|yp) operations. 
Hell and Kirkpatrick presented an 0{y/{g{V))\F\) algorithm for the general 
{g, /)— factor problem. If g{x) fy f{x) for every x G K, it was shown that there is 
a ( 5 ,/)— factor algorithm of time complexity 0{g{V)\F\) by finding alternating 
paths in |3]. The value p{x),q{x) and the set Fi can easily be found by 0{\V\) 
operations. Let Gi = G — F 2 . Set p'{x) = p{x) — 1 and q'{x) = q{x) — 1 when 
x = Ui,Vi,l < i < k, otherwise, set p'(x) = p{x),q'{x) = q{x). Then we can 
find a (p', q')— factor F in G\ by the algorithm in [Q or [^. It is easy to see that 
F VJ Fi is a (0, /)— factor of G containing Fi and excluding F 2 . It follows from 
the proof of Theorem 2 that G — F is a (0, (m — 1)/ — {m — 1) + 1)— graph. 
Repeating the above procedure, we can find after at most m — 1 operations a 
(0, /)— factorization T = {Fi, F 2 , ..., F^} orthogonal to mutually vertex-disjoint 
subgraphs Hi, H 2 , ■■., Hk in G. 

Our work on the factorization orthogonal to edge disjoint subgraphs is the 
first result in this direction. It would be interesting to know the tight conditions 
for this problem. For vertex disjoint subgraphs, we completely relax the restric- 
tion of the previous work on the lower bound g (to the all zero function) and 
conclude the study in this direction. 

Acknowledgments. The author is grateful to Dr. Xiaotie Deng and Professor 
Guizhen Liu for their helpful comments and advice on this paper. 
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Abstract. In this paper, we deal with the notion of star coloring of 
graphs. A star coloring of an undirected graph G is a proper vertex col- 
oring of G (i.e., no two neighbors are assigned the same color) such that 
any path of length 3 in G is not bicolored. 

We give the exact value of the star chromatic number of different fami- 
lies of graphs such as trees, cycles, complete bipartite graphs, outerplanar 
graphs and 2-dimensional grids. We also study and give bounds for the 
star chromatic number of other families of graphs, such as hypercubes, 
tori, d-dimensional grids, graphs with bounded treewidth and planar 
graphs. 

Keywords : graphs, vertex coloring, proper coloring, star coloring, 
acyclic coloring, treewidth. 



1 Introduction 

All graphs considered are undirected. In the following definitions (and in the 
whole paper), the term coloring will be used to define vertex coloring of graphs. 
A proper coloring of a graph G is a labeling of the vertices of G such that no two 
neighbors in G are assigned the same label. Usually, the labeling (or coloring) of 
vertex x is denoted by c{x). In the following, all the colorings that we will define 
and use are proper colorings. 

Definition 1 (Star coloring). A star coloring of a graph G is a proper col- 
oring of G such that no path of length 3 in G is hicolored. 

We also introduce here the notion of acyclic coloring, that will be useful for 
our purpose. 

Definition 2 (Acyclic coloring). An acyclic coloring of a graph G is a 
proper coloring of G such that no cycle in G is bicolored. 



A. Brandstadt and V.B. Le (Eds.): WG 2001, LNCS 2204, pp. 140 41531 2001. 
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For any of the above colorings, we define by chromatic number of a graph 
G the minimum number of colors which are necessary to color G according to 
the definition of the given coloring. Depending on the coloring, the notation for 
the chromatic number of G differs ; it is denoted x(G) for the proper coloring, 
Xs(G) for the star coloring and a(G) for the acyclic coloring. 

By extension, the chromatic number of a family T of graphs is the minimum 
number of colors that are necessary to color any member of T according to the 
definition of the given coloring. Depending on the coloring, this will be denoted 
either by Xs(^) or a(T). 

The purpose of this paper is to determine and give properties on Xs(-?^) for 
a large number of families of graphs. In Section [2l we motivate the problem 
and present general properties for the star chromatic number of graphs. In the 
following sections (SectionOtoED, we determine precisely Xs{^) for trees, cycles, 
complete bipartite graphs and 2-dimensional grids and we give bounds on Xs{^) 
for other families of graphs, such as hypercubes, graphs with bounded treewidth, 
outerplanar graphs, d-dimensional grids, 2-dimensional tori. 



2 Generalities 

We note that for any graph G, a star coloring of G is also an acyclic coloring of 
G : indeed, a cycle in G can be bicolored if and only if it is of even length, that 
is of length greater than or equal to 4. However, by definition of a star coloring, 
no path of length 3 in G can be bicolored. Thus the following observation. 

Observation 1 For any graph G, a{G) < Xs(G). 

Actually we can remark that the star coloring is an acyclic coloring such that 
if we take two classes of colors then the induced subgraph is a bipartite graph 
composed only of stars. Star coloring was introduced in 1973 by Griinbaum [S]. 
He linked star coloring to acyclic coloring by showing that any planar graph has 
an acyclic chromatic number less than or equal to 9, and by suggesting that this 
implies that any planar graph has a star chromatic number less than or equal 
to 9 • 2® = 2304. 

However, this property can be generalized for any given graph G, as men- 
tioned in |3I2| . In | 3I2| . the result was just stated, but no proof was given. We 
detail the proof here for completeness. 

Theorem 1 (Relation acyclic/star coloring). For any graph G, if the acyclic 
chromatic number of G satisfies a{G) < k, then the star chromatic number of G 
satisfies Xs(G) < k ■ 2^“^. 

In order to prove the theorem, we shall use the following proposition. 

Proposition 1. Let T be a tree and Vi and V 2 be the bipartition of its set of 
vertices, then there exists a star coloring of T c : V{T) — )> {0,1, 2, 3} such that 
if V € Vi then c(v) € {0,2} and if v € V 2 then c(v) € {1,3}. 
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Proof. Let v be any vertex belonging to Vi, we give the following coloring c of 
the vertices of T : c(v) = 0 and for x G V(T) \ {u}, c(x) = d(v,x) mod 4. It is 
easy to see that this is a star coloring and that it has the required properties of 
the proposition. 

Now we prove the theorem. 

Proof. Let G = (V, E) be a graph with a{G) < k, and let Vi, V2, . . . , 14 be the 
color classes of an acyclic coloring of the vertices of G with k colors. The color 
classes form a partition of the set V and for any i ^ j belonging to {1, . . . , A:}, the 
subgraph induced by ViUVj, denoted by G\ViUVj\, is a forest. For any G[1^UV,] 
{i < j), we denote by Cij a star coloring of the vertices with 4 colors such that if 
V GVi then Cij{v) G {0, 2}, and ifv GVj then c{v) G {1, 3}. We define the follow- 
ing coloring c of the vertices of G : let m G V, then there is a unique z G {1, . . . , fc} 
such that V G Vi, let c(u) = (ciy(M), . . . , *, Ci,j+i(u), . . . , Ci,fc(u)). We 

have to notice that the terms before the star belong to {0, 2} and that the terms 
after the star belong to {1, 3}. By this definition we use at most k2^~^ colors. It 
is clear that this is a proper coloring, indeed ii xy G E then x and y belong to 
different sets Vi, and by definition c{x) yf c(y). Now we have to prove that there 
is no bicolored path of length 3. By contradiction let us assume that there is 
such a path P with V{P) = {u,v,w,t} and E{P) = {uv,vw,wt}. By definition 
of the coloring the bicolored path is between Vi and Vj for some i < j. W.l.o.g., 
assume that u,w G Vi and v,t G Vj ] if c{u) = c{w) and c{v) = c{t), then we 
have Cij(u) = Cij(w) and Cij{v) = Cij{t), which is impossible because there is 
no bicolored path in G\Vi U Vj] with the coloring aj. This completes the proof. 

In the following, let V denote the family of planar graphs. We deduce the 
following easy result. 

Theorem 2. 7 < Xsi'P) ^ 80. 

Proof. Griinbaum showed that any planar graph has an acyclic coloring using at 
most 9 colors, but conjectured that the exact answer was 5. Moreover, he gave 
an example of a planar graph G for which a(G) = 5. This conjecture was solved 
in 1979 by Borodin [S]. This result, combined with Theorem [1] gives the upper 
bound for the theorem. 

Moreover, there exists a planar graph G2 for which any star coloring needs 
7 colors. This graph is shown in Figure H] (right). Thanks to the computer, we 
know that Xs(G2) = 7. 

The girth 5 of a graph G is the length of its shortest cycle. In [3], it is proved 
that if G is planar with girth g > 5 (resp. g > 7), then a(G) < 4 (resp. a(G) < 3). 
Together with Theorem [T] we deduce : 

Corollary 1. If G is a planar graph with girth g > 5, then Xs(G) < 32. If G is 
a planar graph with girth g >7, then Xs{G) < 12. 

Several graphs are cartesian product of graphs (Grid, Hypercube, Tori), so it 
is interesting to have an upper bound for the star chromatic number of cartesian 
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Fig. 1. Planar graph : Xs{G 2 ) ~ 7 



product of graphs. We recall that the cartesian product of two graphs G = (V, E) 
and G' = {V , E'), denoted by GDG', is the graph such that the set of vertices 
is y X 17' and two vertices {x,x') and {y,y') are linked if and only ii x = y and 
x'y' is an edge of G' or x' = y' and xy is an edge of G. 

Theorem 3. For any two graphs G and H, Xs{GOE[) < Xs{G) ■ Xs{H). 

Proof. Suppose that Xs{G) = g and Xs{H) = h, and let cq (resp. ch) be a star 
coloring of G (resp. H) using g (resp. h) colors. In that case, we assign to any 
vertex (m, v) of GDiJ color [cg{u),Ch{v)]. This coloring uses g ■ h colors, and this 
defines a star coloring. Indeed, suppose that there exists a path P of length 3 
that is bicolored in GDEl, with V{P) = {x,y,z,t} and E{P) = {xy,yz, zt}. 
Depending on the composition of the ordered pairs corresponding to the vertices 
of the path, we have 8 possible paths. We will only consider 4 of them, because 
by permuting the first and second component of each ordered pairs, we obtain 
the others. The 4 possible paths are : 

(1) X = {u,v),y = {u, vi),z= (w, V 2 ),t= (m, us) 

(2) X = {u,v),y= {u,vi),z= {u,V 2 ),t= (ui,V 2 ) 

(3) X = {u,v),y= {u,vi),z= {ui,vi),t= (ui,V2) 

(4) X = {u,v),y = {u, vi),z= {u 2 ,vi),t = {us, v{) 

Clearly, in the first case P cannot be bicolored, since the path u, vi,V 2 , us is not 
bicolored in FI. For the second case : y and t have different colors {viV 2 is an 
edge of F[). For the third case : x and 2 : have different colors {vv\ is an edge of 
FI). The same argument works for the last case. 

Observation 2 For any graph G and for any 1 < a < |y(G)|, let Gi, . . ., Gp 
be the p connected components obtained by removing a vertices from G. In that 
case, Xs{G) < maxi{xs(Gi)} + a. 

Proof. Star color each Gi, and reconnect them by adding the a vertices previ- 
ously deleted, using a new color for each of the a vertices. Any path of length 
3 within a Gi will be star colored by construction, and if this path begins in Gi 
and ends in Gj with i j, then it contains at least one of the a vertices, which 
has a unique color. Thus the path of length 3 cannot be bicolored, and we get a 
star coloring of G. 
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Remark 1. For any a > 1, the above result is optimal for complete bipartite 
graphs Kn^m- W.l.o.g., suppose n < m and let a = n. Remove the a = n vertices 
of partition We then get m isolated vertices, which can be independently 
colored with a single color. Then, give a unique color to the a = n vertices. We 
then get a star coloring with n + 1 colors ; this coloring can be shown to be 
optimal by Theorem 



Observation 3 For any graph G that can be partitioned into p stables Si, . . 
Sp, XsiG) < l + |R(G)|-max,{|5,|}. 

Proof. Let 1 < j < p such that Sj is of maximum cardinality. Color each vertex 
of Sj with a single color c, and give new pairwise distinct different colors to all 
the others vertices. This coloring holds the desired number of colors. It is clearly 
a proper coloring, and it is also a star coloring, because there is only one color 
which is used at least twice. 

Remark 2. The above result is optimal for complete p-partite graphs R^si,s 2 ,. .,sp- 

3 Trees, Cycles, Complete Bipartite Graphs, Hypercubes 



Theorem 4. Let be the family of forests such that d is the maximum depth 
over all the trees contained in Td- In that case, Xs{^d) = min{3, d+ 1}. 

Proof. Let F be a forest contained in tFd- When d = 0, the result is trivial {F 
holds no edge). When d = 1, we color each root of each tree contained in F with 
color 1, and all the remaining vertices by color 2. This is obviously a proper 
coloring, and since in that case there is no path of length 3, it is consequently 
a star coloring as well. Now we assume d > 2. We then color each vertex v, of 
depth dy in F, as follows : c{v) = dy mod 3. Clearly, this is a proper coloring of 
F and it is easy to see that it is a star coloring. 



Theorem 5. Let Cn be the cycle with n > 5 vertices. 

, „ / 4 when n = 5 

Xs(, n) yti otherwise 

Proof. It can be easily checked that XsiC^) = 4. Now let us assume n yf 5. 
Clearly, 3 colors at least are needed to star color G„. We now distinguish 3 cases : 
first, if n = 3fc, we color alternatively the vertices around the cycle by colors 1,2 
and 3. Thus, for any vertex u, its two neighbors are assigned distinct colors, 
and consequently this is a valid star coloring. Hence Xs(C 3 fc) < 3. Suppose now 
n = 3/c + l. In that case, let us color 3k vertices of Cn consecutively, by repeating 
the sequence of colors 1,2 and 3. There remains 1 uncolored vertex, to which we 
assign color 2. One can check easily that this is also a valid star coloring, and 
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thus Xs(C'3fc+i) < 3. Finally, let n = 3fc + 2. Since the case n = 5 is excluded 
here, we can assume k >2. Thus n = 3(fc — 1) + 5, with fc — 1 > 1. In that case, 
let us color 3{k — 1) consecutive vertices along the cycle, alternating colors 1,2 
and 3. For the 5 remaining vertices, we give the following coloring : 2, 1,2, 3, 2. 
It can be checked that this is a valid star coloring, and thus Xs{C 3 k+ 2 ) < 3 for 
any k >2. Globally, we have Xs(C'„) = 3 for any n 5, and the result is proved. 

Theorem 6. Let Kn,m be the complete bipartite graph with n+m vertices. Then 
Xs{Kn,m) = min{m, n} + 1. 

Proof. W.l.o.g., let n < m. The upper bound of n + 1 immediately derives from 
Observations (cf. Remark [I] for a detailed proof). 

Now let us prove that Xs{Kn,m) > n + 1. For this, let us show that any 
coloring with n colors will give us at least one bicolored cycle of length 4. Let 
Sn (resp. Sm) be the set of colors used to color the vertices of Vn (resp. Vm). 
Clearly, since all possible edges exist between vertices of Vn and vertices of Vm, 
we must have Sn H Sm = 0 in order to achieve a proper coloring. Suppose then 
that we use n — x colors for the vertices of Vn, and x colors for the vertices of 
Vm, with 1 < a; < n — 1. In that case, there exists at least 2 vertices a„ and 
in Vn (resp. am and hm in Vm) that are given the same color Ci (resp. C 2 ). By 
definition of Kn,m, there exists a cycle of length 4 going through the vertices a„, 
bm, bn, am, and this cycle is bicolored with colors Ci and C 2 . Thus, no coloring 
that uses n colors can be a star coloring, and Xs{Kn,m) > n + 1. 

For our purpose we introduce a new definition here : the k distance coloring of 
graphs. 

Definition 3 (fc distance coloring). A k distance coloring of a graph G 
is a coloring of G such that any two vertices at distance at most k are assigned 
different colors. 

Remark 3. For any k > 1, k distance coloring is a generalization of proper 
coloring, since the latter can be seen as a, k distance coloring where k = 1. 

We define, as in Section [I] the k distance chromatic number of a graph G as 
being the minimum number of colors necessary to get a k distance coloring of 
G. It is noted Xfe(G). 

In this section, we will only focus on the case k = 2. Indeed, it is easy to 
see that for any graph G, a 2 distance coloring of G is also a star coloring of G. 
Hence the following observation. 

Observation 4 For any graph G and for any k (k > 2), Xs(G) < Xfc(G). 

2 distance coloring (and, more generally, k distance coloring) of hypercubes 
has already been studied [T2l9]. The main result concerning 2 distance coloring 
is the following. 

Theorem 7. WA Let Hn denote the hypercube of dimension n. For any n > 1, 
« + 1 < X2{Hn) < 2ri°S2("+Bl . 
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Wan conjectured that the above upper bound is tight. Note that Theorem 0 
above implies that x^iHn) lies between n + 1 and 2n, depending on the value of 
n ; this also shows that there is an infinite number of cases for which the result 
is optimal : indeed, when n = 2^ — 1 the lower and upper bounds coincide, and 
in that case X 2 {H 2 k_i) = 2^ = n + 1. 

Thanks to this theorem, and thanks to Observation [d] we get the following 
property. 

Property 1. For any n, Xs{Hn) < 2 l*°S 2 ("+i)l ^ 

However, as can be seen in Table [T] this upper bound is not tight. All results 
given in bold characters in Table[T]are optimal ; the ones concerning star coloring 
are discussed below. 

Table 1. Star and 2 distance coloring of Hn 



n 


Xs{H„) 


X2{Hr.) 


n 




X2{Hr.) 


2 


3 


4 


5 


6 


8 


3 


4 


4 


6 


6 


8 


4 


5 


8 


7 


6 < < 8 


8 



Property 2. XsiH^) = 3, XsiH^) = 4 and X-s{Hi) = 5. 

Proof. The first result is trivial. It can also be easily seen that Xs{H^) > 4. Since 
is a subgraph of i? 4 , we deduce that Xs{Ha) > 4. 

The upper bounds are given, in each case, by a star coloring with the appro- 
priate number of colors. These are given in Figure l2l 




Fig. 2. Star coloring of H 2 , H 3 and H 4 



Property 3. Xs{H^) = 6, X.s{H^) = 6 and x.s{Hr) < 8. 
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Proof. The lower bound of 6 for XsiH^) and Xs{Hq) is given by computer : there 
is no possible star coloring of those two graphs with 5 colors. 

The upper bounds are given in each of the 3 cases by an appropriate star 
coloring using the required number of colors. Those colorings are detailed in the 
appendix of [7], where the result is given as a list of correspondences between 
each vertex and its color. 

4 d-Dimensional Grids 

In this section, we study the star chromatic number of grids. More precisely, we 
give the star chromatic number of 2 dimensional grids, and we extend this result 
in order to get bounds on the star chromatic number of grids of dimension d. 

We recall that the grid G{n, m) is the cartesian product of two paths of length 
n — 1 and m — 1. Due to symmetries, we will always consider in the following 
that m > n. A summary of the results is given in Table E] ; those results are 
detailed below. 



Table 2. Star coloring of 2-dimensional grids G(n, m) (n < m) 





m = 2 


m = 3 


m > 4 


n = 2 


3 


4 


4 


n = 3 


XXX 


4 


4 


n > 4 


XXX 


XXX 


5 



Proposition 2. Xs(G(2, 2)) = 3, and for any m > A, Xs{G{2,m)) = Xs{G{3,m)) 
= 4. 

Proof. The first result is trivial. 

It is easily checked that Xs{G{2,m)) > 4 for any m > 3, since star coloring 
of G(2,3) requires at least 4 colors, and since G(2, 3) is subgraph of G(2,m) 
for any m > 3. Moreover, as can be seen in Figure El (left), it is possible to 
find a 4 star coloring of G{2,m). Thus Xs(G(2,m)) < 4, and altogether we have 
Xs(G(2,to)) = 4. 

The fact that Xs(G(3, m)) > 4 for any m > 3 is trivial, since G(2,3) is 
a subgraph of G(3,m). Moreover, Figure E] (right) shows a 4 star coloring of 
G(3 ,to)) for any m > 3. 

The main theorem of this section is the following. 

Theorem 8. For any n and m such that min{n,77i} > 4, Xs(G(n,m)) = 5. 

Proof. By a rather tedious case by case analysis (confirmed by the computer), 
it is possible to show that 5 colors at least are needed to color G(4,4). Hence, 
for any n and m such that min{n,m} > 4, Xs{G{n,m)) > 5. 
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Fig. 3. 4 star colorings of G(2, m) (left) and G(3, m) (right) 



Now let us show that 5 colors are sufficient to get a star coloring of G(n, m) . 
Our aim is to have colors such that no two vertices at distance 2 are assigned 
the same color (except for the color 1): for this, we will identify any vertex Vi of 
G(n, m) by its coordinates {xi, yi), Q < Xi <n — 1 and 0 < < m — 1. We now 

describe the coloring scheme : 

— Any vertex v = {x, y) with x + y = Q mod 2 is assigned color 1 ; 

— Any vertex v = (4p, 4y + 1) and v = (4p + 2, 4q + 3) is assigned color 2 ; 

— Any vertex v = (4p, Aq + 3) and v = (4p + 2, 4q + 1) is assigned color 3 ; 

— Any vertex v = (4p + 1, Aq) and v = {Ap + 3, 4q + 2) is assigned color 4 ; 

— Any vertex v = (4p + 1, 4q + 2) and v = {Ap + 3, Aq) is assigned color 5. 

An example of this coloring is shown in Figure with n = 5 and m = 9. 

It can be easily checked that the above coloring assigns a color to each vertex, 
and that this coloring is proper ; moreover, this is a valid 5 star coloring of 
G(n, m), since two vertices u and v at distance 2 are assigned the same color 
iff c{u) = c(v) = 1. In that case, there cannot exist a path of length 3 that is 
bicolored. Thus Xs{G{n,m)) < 5, and the result is proved. 




We know that d-dimensional grids Gd are isomorphic to the cartesian product 
of d paths. Hence, by Theorems El and E] we get an upper bound for Xs{Gd) ■ 
Xs{Gd) < for any d > 1. Due to the result of Theorem Elabove, we can slightly 
improve this bound to 5 • for any d > 2. However, it is even possible to 

do better : for this, we generalize the star coloring of paths and 2-dimensional 
grids. This is the purpose of the following theorem. 

Theorem 9. Let Gd be any d- dimensional grid, d > 1. Then Xs{Gd) < 2^^ -|- 1. 
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Proof. The idea here is to design a star coloring for Gd that generalizes the 
one given for d = 2 in Proof of Theorem More precisely, we want to define a 
coloring of Gd such that, for any fixed d—2 dimensions, the induced 2 -dimensional 
grid has a coloring similar to the one of Figure[4[ Let us now detail the coloring in 
itself : first, Gd being bipartite, we color every vertex of one of the two partitions 
of Gd by color 1. More formally, if every vertex v G Gd is defined by its d 
coordinates, that is = (xi, X 2 ■ ■ ■ Xd), then c{v) = 1 V Xi,X 2 ■ ■ -Xd s.t. X\ P 
X 2 + ■ ■ ■ + Xd = Q mod 2. Now, we need to assign the 2'^ remaining colors to the 
remaining vertices. Again, we will do it depending on the coordinates X\,X 2 ■ ■ - Xd 
of the considered vertex v. For this, we will use a set Sd of 2^^ vertices, chosen 
thanks to their coordinates, and for which every vertex of Sd will be assigned a 
unique color 2 < c < 2^^ -|- 1. Starting from the vertices of Sd we will then apply 
a rule to color the still uncolored vertices. First, let us detail the construction 
of Sd- Let Vo be the set of vertices of Gd holding an even number (resp. odd 
number) of coordinates equal to 0 if d is odd (resp. if d is even). Let now Vb,i 
be the subset of Vq such that all the non-zero coordinates are all equal to 1. It 
is not difficult to see that |Vb.i| = 

Now let Vb,i ,3 be the subset of Vq such that all but one of the non-zero 
coordinates are equal to 1, the last one being equal to 3. Finally, let VJ) ^ 3 be 
the maximum subset of Vo,i ,3 such that any two vertices u and v in Vq 3 3 have 
at least one zero-coordinate location which differs. In that case, it is clear that 
|Pq 1 3 I = 2‘^~^. We now define Sd as follows : Sd = Vb,i U Iq' 1 3 . 

Example : in the case d = 4, Vq 1 = {(1,0, 0,0), (0,i,0, 0), (0,0, 1,0), 
(0, 0, 0, 1), (1, 1, 1,0), (1, 1,0, 1), (1,0, 1,1), (0,1, 1,1)}, while Ki 3 = 1(3,0, 0,0), 
(0,3,0,0), (0,0,3, 0), (0,0,0, 3), (3, 1, 1, 0), (3, 1,0,1), (3, 0,1,1), (0,3,1,!)}. 

We assign to each vertex of Sd a unique color 2 < c < 2^-|- 1. Now, let us take 
a vertex u G Sd with color c{u) ; we then assign c(u) to any vertex u' obtained 
from u by any combination of the two following rules : 



— (Rule 1) : add ±4 to exactly one of the coordinates ; 

— (Rule 2) : add ±2 to exactly 2 of the coordinates. 



In that case, it can be seen that all the vertices of the d-dimensional grid 
have been assigned a color : indeed, every vertex v such that the sum of its 
coordinates is even is assigned color 1 ; moreover, for every other vertex, Sd 
constitues a basis of colored vertices, from which, thanks to Rules 1 and 2, every 
vertex in Gd is assigned a color. 

This coloring is clearly a proper coloring of Gd (no two neighbors have been 
assigned the same color). Moreover, this is also a valid star coloring, since every 
vertex u s.t. c(u) yf 1 is such that any vertex v with color c{v) = c(u) is at least 
at distance 4 from u (by Rules 1 and 2). The above-mentioned coloring using 
2^^ -I- 1 colors, we conclude that Xs{Gd) < 2^^ -|- 1. 



Remark 4- We note that for dimensions 1 and 2, the upper bound given by 
Theorem for d-dimensional grids is tight (cf. Theorem when d = 1 and 
Theorem |8] when d—2). 
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5 2-Dimensional Tori 



In the following, for any n,m > 3, we denote the toroidal 2-dimensional grid 
by TG{n,m). We recall that TG{n,m) is the cartesian product of two cycles of 
length n and m. Our main result here is the following. 

Theorem 10. For any 3 < n < m, 5 < Xs{TG{n,m)) < 6. 

Proof. The proof is given in [7] . More precisely, we show that there is an infinite 
number of cases of TG{n,m), for which Xs{TG{n,m)) = 5. The fact that 5 
colors are always needed, and that 6 colors always suffice for the star coloring of 
TG{n,m), n,m>3 derives from several propositions (see 0)- 

In some cases, we have been unable to determine precisely the number of 
colors necessary to star color TG{n,m), n,m>3 (though we know it up to an 
additive constant of 1). However, we pose the following conjecture. 

Conjecture 1. For any n > m > 3 



Xs{TG{n,m)) 



6 when n = m = 3 or when n = 3 and m = 5 
5 otherwise 



6 Graphs with Bounded Treewidth 

The notion of treewidth was introduced by Robertson and Seymour m- A tree 
decomposition of a graph G = (V,E) is a pair {{Xi\i G I},T = (I,F)) where 
{Xi\i £ /} is a family of subsets of V , one for each node of T, and T a tree such 
that : 

( 1 ) U.^iXi = v 

(2) For all edges vw G E, there exists an. i G I with v G X^ and w G Xi 

(3) For all i,j,kGl : if j is on the path from z to fc in T, then X^ fl X^ C Xj 

The width of a tree decomposition {{Xi\i £ /}, T = (/, E)) is max^g/ \Xi\ — 1. 
The treewidth of a graph G is the minimum width over all possible tree decom- 
position of G. 

We will prove the following theorem. 

Theorem 11. If a graph G has a treewidth at most k, then Xs{G) < k(k + 
3)/2+l 

Actually we will prove Theorem|TT]for fc-trees, because it is well known that 
the treewidth of a graph G is at most k {k > 0) if and only if G is a partial 
fc-tree [Sj- 

We recall the definition of a fc-tree |1] : 

(1) a clique with fc- vertices is a /c-tree 

(2) If T = (V, E) is a fc-tree and G is a clique of T with k vertices and x , 
then T' = {V LI {x}, E U {cx : c G G}) is a fc-tree. 

If a fc-tree holds exactly fc vertices, then it is a clique by definition. If not, 
it contains at least a fc -I- 1 clique ; moreover, it is easy to see that the greedy 
coloring with fc -|- 1 colors of a fc-tree gives an acyclic coloring. Hence we can 
deduce the following observation. 
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Observation 5 For any k > 1 and any k-tree Tk : 

- Xs{Tk) = ktf\V{Tk)\ = k; 

— k + 1 < Xs{Tk) < (fc + 1) • 2 ^ otherwise. 

Theorem II II is in fact a corollary of the following result, which gives a much 
tighter bound than the one of Observation 

Theorem 12. fc + 1 < Xs(Tfc) < fc(fc + 3)/2 + 1 

Proof. Consider a A:-tree G. We recall that a fc-tree G is an intersection graph 
PH and can be represented by a tree T and a subtree S'„ for each v in G s.t. : 

(1) uv G E{G) ^ 

(2) for any t G T, |{u : t G S'„}| = fc + 1 

We can see that by considering the tree decomposition of the fc-tree. The 
tree T is the one of the tree decomposition and the subtree Sy for v G V (G) is 
exactly the subtree of T containing the nodes of T corresponding to the subsets 
of the tree decomposition containing v. 

We root T at some node r and for each vertex u of G let t{v) bet the first node 
of Sy obtained when traversing T in pre-order (i.e. t{v) is the “highest” node of 
Sy ) . We choose some fixed preorder and order the nodes as vi,V2, ■ . . ,Vn so that 
for i < j, t{vi) is considered in the preorder before t{vj). We fc(fc -I- 3)/2 -|- 1 color 
the nodes in this order. 

For each i we let : 



Svj^t{vi) 

We show now that < k(k -I- 3)/2 and for any vj G Xy., j < i. Indeed 
let A = {oi, 02 , • • • , Ofc, Ui} be the subset of vertices corresponding to t{vi). We 
assume that t{ai) is before t{a *+i) (* e {1, . . . , k — 1}) in the preorder. We first 
give an upper bound for the number of subtrees which contain t{ai) {vi ^ A). 
The number of Sy^ (vi ^ A) for i G {1, • • • , fc} which contain t{ai) is at most k, 
because the corresponding subset can intersect A only in a\. The number of Sy,^ 
(yi A) which contain t{a2) is at most fc — 1, a\ is in the subset corresponding to 
t{a2) and we do not count Sa^ . It is easy to see that the number of {vi ^ A) 
which contain t{ai) is at most fc-|-I — *, because the subset corresponding to t{ai) 
contains {oi, 02 , • • • , at}. In total we have at most + 1) /2 subtrees 

Syj with vi ^ A containing t{ai) i G {1, • • • , k}. Now we have to add the number 
of Sai, this gives \Xy.\ < k{k + 3)/2, without counting Vi. We color Vi with any 
color not yet used on Xy.. This clearly yields a proper coloring, indeed if xy 
is an edge of G then H Sy ^ ^ and either t{x) G Sy or t{y) G S^, hence by 
construction x and y have different colors. We claim it also yields a coloring with 
no dichromatic P 4 (a path of length 3) : assume the contrary, and let {x,y, z,w} 
be this P4 labelled so that 
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(1) t{x) is the first of t{x),t{y),t{z),t{w) considered in the preorder, 

(2) X and y have the same color, 

(3) xz and yz are in E{G). 

We have to notice that x,z,y are not in the same clique ATfc+i of the graph 
G corresponding to a node of T, because by construction this would imply that 
the colors are different. Now Sy = 0 , Szr\ Sx ^ 0 , Sz C\ Sy 0 , so by (1) 
t{y) is in Sz- Further since zx G E{G), by (1) we have t{z) is in Sx- So x G Xy, 
contradicting the fact that x and y get the same color. 




Fig. 5. A 2-tree H that satisfies Xb{H) — 6 



We can notice that for 1-trees (i.e. the usual trees), the upper bound we 
obtain matches the one given by Theorem For 2-trees, the upper bound is 
optimal because of the graph in Figure which has been shown by computer 
to have a star chromatic number equal to 6. 

In the following, we will denote by O the family of outerplanar graphs. 

Corollary 2. Xs{0) = 6. 

Proof. It is well-known that any outerplanar graph is a partial 2-tree, then 6 
colors suffice ; moreover the graph in Figure El which is also outerplanar, needs 

6 colors. 

7 Conclusion 

In this paper, we have provided many new results concerning the star chromatic 
number of different families of graphs. In particular, we have provided exact re- 
sults for paths and trees, cycles, complete bipartite graphs, 2-dimensional grids 
and outerplanar graphs. We have also determined bounds for the chromatic num- 
ber in several other families of graphs, such as tori, d-dimensional grids, graphs 
with bounded treewidth, planar graphs and hypercubes. We have also deter- 
mined several more general properties concerning the star chromatic number. 

Using the techniques of [T], we can show that the star chromatic number of 
a graph of maximum degree A is 0{A^) and that for every A, there exists a 
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graph of maximum degree A whose star chromatic number exceeds e. for 
some positive absolute constant e. Details will appear in the journal version of 
the paper. 

A large number of problems remain open here, such as getting optimal results 
for other families of graphs, or refining our non optimal bounds ; getting one or 
several methods to provide good lower bounds for the star chromatic number is 
also another challenging problem. 
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Abstract. In a graph coloring, each color class induces a disjoint union 
of isolated vertices. A graph subcoloring generalizes this concept, since 
here each color class Induces a disjoint union of complete graphs. Erdos 
and independently Albertson et al. proved that every graph of maximum 
degree at most 3 has a 2-subcoloring. We point out in this paper that this 
fact is best possible with respect to degree-constraints by showing that 
the problem of recognizing 2-subcolorable graphs with maximum degree 
4 is NP-complete, even when restricted to triangle-free planar graphs. 
Moreover, in general, for fixed k, recognizing fc-subcolorable graphs is 
NP-complete on graphs with maximum degree at most k^. In contrast, 
we show that, for arbitrary k, fc-SuBCOLORABiLlTY can be computed 
efficiently on graphs of bounded treewidth and on cographs. 



1 Introduction and Results 

A k-coloring of a graph G is a partition of the vertices into k pairwise disjoint 
sets Vi, . . . , 14 such that for every f = 1, 2, . . . , fc, each color class V consists of 
isolated vertices, i.e. it forms a stable set. The smallest k for which the graph G 
has a fc-coloring is called the chromatic number of G, denoted by %(G). Graph 
coloring is a well studied topic, both for its theoretic and algorithmic aspects. 
It is well-known that testing 3-Colorability is NP-complete for triangle-free 
graphs with maximum degree 4 f|14p and for planar graphs with maximum de- 
gree 4 ([T^). Testing 3-Colorability is easy for graphs with maximum degree 
3 (Brooks theorem) and for triangle- free planar graphs (Grotzsch theorem). 

Graph coloring has been generalized in several ways and by a number of 
authors; see [2] for a comprehensive survey. In this paper we address one of 
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these generalized colorings. A partition V\, ... ,Vk of vertex set of a graph G is 
called a k-subcoloring of G if each color class Vi induces in G a disjoint union 
of complete subgraphs (of various sizes). The subchromatic number Xs(G) of 
G is the smallest integer k for which G has a /c-subcoloring. Subcolorings have 
been discussed in 1214151151 . It turns out that subcolorings have many interesting 
properties similar to colorings, and every fc-coloring is also a /c-subcoloring, hence 
x{G) > Xs{G). Among other results we would like to mention the following 
properties of graph subcolorings: 

(1) For every fc > 1, there is a triangle-free graph Gk with Xs{Gk) = k [mi]. 

(2) For every planar graph G, Xs(G) < 4. In addition, if G is outerplanar, 
Xs(G) < 3. These bounds are tight [Ij. 

(3) For every graph G with maximum degree A, Xs(G) < [yj -h 1 [2|. 

By (3), every graph with maximum degree at most 3 is 2-subcolorable. Ac- 
tually, this fact follows also from a theorem due to Erdos m saying that every 
graph G has a bipartite spanning subgraph H such that the degree in H of every 
vertex is at least one half of its degree in G. Thus, if the maximum degree in 
G is at most 3, every bipartition of H defines a sub-bipartition of G. Moreover, 
such a bipartite spanning subgraph H of G can be found in polynomial time, 
easily by a “local improvement” technique. 

Albertson et al. (i) then pointed out the difficulties involved in character- 
izing 2-subcolorable graphs by giving a number of examples. In this paper we 
prove the following theorems ( Sect . I2TT1 and [2?^ : 

Theorem 1. Recognizing 2-subcolorable graphs of maximum degree 4 is NP- 
complete, even on triangle-free planar graphs. 

Albertson informed us that, independently, Gimbel also proved Theorem [I] 
with a completely different reduction 

Notice that the NP-completeness of /c-Subcolorability for the class of all 
graphs follows from a theorem by Achlioptas [I]. We prove in Sect. I2.3l that: 

Theorem 2. For every fixed k > 2, /c-Subcolorability is NP-complete for 
graphs of maximum degree at most kf . 

For fixed k, /c-Subcolorability can be expressed as a monadic second order 
logic formula, and hence can be tested in linear time for graphs with bounded 
treewidth and cographs. Due to the fact that for these graphs Xs(G) < c for a 
constant c, we get that there exists an algorithm that in linear time determine 
Xs(G) for graphs with bounded treewidth and cographs. On the other side, the 
general algorithm is unnecessarily complicated for our purpose and we present 
two simple algorithms solving this problem, (see Sect. |3]and|4|) 

Theorem 3. For every k, fc-SuBCOLORABiLiTY can be efficiently decided on 
graphs of bounded treewidth, and on cographs. 

Various proofs are omitted in this paper due to space restrictions. In the full 
version of this paper we present a general algorithm for graphs with bounded 
cliquewidth and also NP-completeness of 3-Subcolorability of planar graphs. 
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2 The NP-Completeness of the Subcoloring Problem 

We prove Theorem [T]by reducing the Not-all-equal 3-satisfiability problem, which 
was proven NP-complete by Schaefer [T7] (see also m Problem L03]). 
Problem: LetC be a Boolean formula consisting of of m clauses such that every 
clause has exactly three distinct literals. The the decision problem whether there 
exists a satisfying assignment for C such that each clause in C has at least one 
false (and at least one true) literal is NP-complete. 

We denote the class of all formulas that allow such assignment by NAE3SAT. 



2.1 Triangle- Free Graphs of Maximum Degree 4 

We first prove in this section the NP-completeness of 2-Subcolorability for 
non-planar triangle-free graphs of maximum degree 4 (the problem is clearly 
in NP). Let C = {Ci, C 2 , . . • , Cm} be a Boolean formula in normal form of m 
clauses with variable set V = {vi,V 2 , ■ • ■ , u„} such that every clause Cj of C 
contains exactly three literals, Cj = {c^i \ C 2 \ c^^}- We will construct a triangle- 
free graph C = C{C) of maximum degree 4 such that C has a 2-subcoloring if 
and only if C G NAE3SAT. 

Before describing the construction, we consider the graph F shown in Figure [T] 
(left); its symbolic representation will be used when larger portions of the graph 
are drawn. 




Fig. 1. The graph F (left) and its symbolic representation (right) 



Lemma 1. The graph F is 2-subcolorable and in every 2-subcoloring of F, the 
labeled vertices u and v belong to different color classes. 

Now, for each clause Cj = {c^i\c2\^3^} consider the graph G{Cj) shown 
in Figure |2] 




Fig. 2. The graph G{Cj) 
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The following Lemma indicates the meaning of G(Cj). 

Lemma 2. In every 2-subcoloring ofG{Cj), not all of Cji,Cj 2 -,Cj^ belong to the 
same color class and each Cjk has a neighbor in G{Cj) that has the same color 
as Cjk- Moreover, every bipartition of {cji,Cj 2 ,cj 3 } into two nonempty sets can 
be extended to a 2-subcoloring of G{Gj). 

For each variable Vi G V let G{vi) be the graph shown in Figure E] 




Vii t>il Vi2 «i2 "im Vim 



Fig. 3. The graph G{vi) 



Lemma 3. The graph G(vi) is 2-subcolorable. Any its 2-subcoloring contains 
vertices vn,Vi 2 , ■ - - ,Vim in the same color class while vertices FIT, FI 2 , • ■ • , Flm 
are colored by the other color. 

We complete the construction of the graph G by taking m disjoint copies of 
G{Gj) (one copy for each clause Gj : 1 < j < n) by n disjoint copies of G{vi) 
(each copy correspond to one variable Vi) and we connect these graphs by adding 
additional edges connecting variables to the corresponding clauses {(cjk,Vij) : 
^ = Vi} and also the negated variables to clauses {{cik,Vij) ■ c^k'^ = Ff}. 
Observe that graphs G{vi) and G{Gj) are triangle free and the additional 
edges form a matching, hence G is triangle-free and moreover every its vertex is 
of degree at most 4. 

For each 1 < j < to and each 1 < fc < 3, the vertex cjk in G{Gj) has exactly 
one neighbor v outside G{Cj), more formally v G {vij,Vij} if Vi or vi is the literal 
(7) 

Cji in Gj. We call v the personal neighbor of Cjk- 

Lemma 4. In every 2-subcoloring of G, Cjk and its personal neighbor belong to 
different color classes. 

Proof. By Lemma |21 Cjk has a neighbor in G{Cj) that has the same color as 
Cjk- Therefore, by definition of subcoloring, the personal neighbor of Cjk has the 
other color. 

We now are going to show that C G NAE3SAT if and only if G has a 
2-subcoloring. 

Let 6 be a truth assignment for C in which every clause has at least one 
true and at least one false valued literal. We construct a 2-subcoloring of G as 
follows. If b{vi) = true then all Vij are red otherwise all Vij are blue, and extend 
to a 2-subcoloring of G{vi). This is possible as we have seen by Lemma EJ Next, 
color Cjk differently from its personal neighbor v G {vij,vij} (which is already 
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colored by the previous step). By assumption on the assignment b, not all of 
Cji,Cj 2 ,Cj 3 can have the same color and due to Lemma we may extend this 
coloring to a 2-subcoloring of G{Cj). By construction of G, the obtained coloring 
is a 2-subcoloring of G. 

Suppose that G has a 2-subcoloring in red and blue. We define the truth 
assignment b for C as follows: b{vi) — true if Vij is red for some j; otherwise 
b{vi) = false. By Lemma this assignment is well-defined. Due to Lemma 
for each j, two of Cji,Cj 2 ,Cj 3 have different colors. By Lemma the personal 
neighbors of these two vertices have different colors. Therefore, Gj has at least 
one true and at least on false literal by the truth assignment b. 

2.2 Planar Graphs of Maximum Degree 4 

In this section, we construct a triangle-free planar graph G' from the graph G 
obtained in the previous section, such that G is 2-subcolorable if and only if G' 
is 2-subcolorable. 

Call an edge in G a cv-edge if it connects a vertex Cjk with its personal 
neighbor v € {vij^Gj}- Note that G can be embedded in the plane, in polynomial 
time, such that every edge is a straight line and every edge crossing occurs only 
by two cu-edges. 

Recall that in any 2-subcoloring of G, the endvertices of every cu-edge belong 
to different color classes (see Lemma HI). This makes the use of the “crossover” 
technique in proving NP-completeness of Planar Graph 3-Colorability, 
described for example in m, possible. 



The “crossover” in our construction is the graph H shown in Figure |4] and 
has the following properties (consult Lemma [T|): 

(1) In any 2-subcoloring of H, x and x' belong to the same color class, and y 
and y' belong to the same color class. 

(2) H has a 2-subcoloring such that x and y belong to the same color class, and 
a 2-subcoloring such that x and y belong to different color classes. 




Fig. 4. The “crossover” graph H 
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The construction of the planar graph G' from G using crossover H is very 
similar to the construction for Planar Graph 3-Colorability. 

— To each cu-edge that is crossed by other cu-edge, add new vertices between 
each pair of adjacent crossings and one more vertex between Cjk and its 
nearest crossing. 

— Replace each crossing by a copy of H, identifying the outlets x and x' with 
the nearest two vertices on one of the cu-edge involved, and y and y' with 
the nearest two vertices on the other cu-edge. 

Observe, that the graph G' has a 2-subcoloring if and only if G does. 



2.3 The Hardness of fc-Subcoloring for fc > 3 

In this section we show that, for each fixed k > 2 the fc-SuBCOLORABiLiTY is a 
NP-complete problem on graphs with maximum degree at most k^. 

Lemma 5. If ip is a k-subcoloring of a graph G and H is an induced subgraph 
of G then the restrietion of ip on H is a k-subeoloring of H. 

Note that Lemma El does not hold for subgraphs in general. 

Lemma 6. For every k >2, the complete k-partite graph Kk^...^k,k+i consisting 
ofk—1 (small) partitions with k vertices and one (big) partition ofk+1 vertices 
has exactly one (up to permutation) k-subeoloring; this k-subcoloring is also its 
unique k-eoloring. The graph Kk,...^k,k+i cannot be subcolored with less than k 
eolors. 

We show a reduction from (A:— 1 )-Subcolorability to fc-SuBCOLORABiLiTY. 
Let G be the graph for which the existence of a fc-subcoloring is questioned and 
let V{G) = {ui, . . . ,Vn}- We define G' as follows: 

— Take n copies Hi, . . . ,Hn of the 

— in each Hi, label four distinct vertices of the big class with x\,xl,x\,x\, and 
label one vertex in each of the fc — 1 small classes with yj, 1 < j < k — 1; 

— add edges (a;j, (xl,yj~^) for all 1 < j < fc — 1, 2 < j < n; 

— add edges {xl,Vi), (x\,Vi) for all 1 < A < n. 

We claim that G is fc — 1-subcolorable if and only if G' is fc-subcolorable: 

(1) Assume G can be subcolored with fc — 1 colors. Then subcolor, in each 
Hi, the fc — 1 small classes with these fc — 1 colors (each class gets one color), 
and take one new color for the big class. This is a fc-subcoloring for G'. 

(2) Consider a fc-subcoloring of G'. Then, by LemmaEl the restriction of this 
subcoloring on each Hi is a fc-subcoloring. By Lemma E] each Hi gets all fc colors 
and each class in Hi is monochromatic. Moreover, the big classes of all HiS have 
the same color. 

We show that, for 1 < i < n, x\ and x^^^ have the same color. Assume not, 
then the color of x^-^^ must occur in a small class of Hi, say y\ has this color. 



160 



Jiff Fiala et al. 



But then yl induce a monochromatic path P 3 in G', contradicting the 

definition of fc-subcoloring. 

No vertex in G can have the color occurring in the big classes of the HiS. 
Therefore, the restriction of the fc-subcoloring of G" on G is a fc — 1-subcoloring 
of G. 

Lemma 7. For fc > 3, ^(G') = max{Z\(G) -I- 2, k^}: 

Proof. Observe that A{G') = max{ Z\(G) -1-2, da> (xl) , dc {yf)} ■ By the construc- 
tion, dcixi) = (fc— l)fc-|-(fc— 1) = k‘^ — 1 and also dciyl) = {k—2)k+{k+l)+2 = 
— fc -I- 3, hence, for fc > 3, the Lemma follows. 

Proof of Theorem The case k = 2 is proven by Theorem [1] The statement 
for fc > 3, follows from the construction and Lemma [3 and by noting that if G 
has maximum degree at most (fc — 1)^ then the graph G' constructed from G as 
above has maximum degree at most fc^. 

3 Graphs with Bounded Treewidth 

The notion of treewidth was introduced by Robertson and Seymour in m via 
tree decompositions. 

Let G = (Vj E) be a graph. The tree decomposition of G is a pair {T,X = 
{Xi : i G V{T)}), where T is a tree, each is a subset of V and the following 
is satisfied 

1. For each edge {u,v) G E exists i G V{T) such that u,v G Xi. 

2. For any v G V(G) the indices of sets Xi containing v induce a nonempty 
connected subtree of T. 

The width of a tree decomposition (T,X) is m.aKi^y(^T){\Xi\} — 1 and the 
treewidth of G is the minimum width among all possible tree decompositions. 
We denote treewidth by tw{G). If the treewidth of G is bounded by a constant 
c, a tree decomposition of width < c of G can be constructed in linear time ([31). 

For fixed fc, fc-SuBCOLORABiLiTY can be expressed in monadic second or- 
der logic, which is a language to describe graph properties, using the following 
constructions: quantifications over vertices, edges, sets of edges, sets of vertices, 
membership tests, adjacency tests and logic operations. By the results of Cour- 
celle (|H]) it is known that each problem that can be stated in monadic second 
order logic can be solved in linear time on graphs with bounded treewidth. Un- 
fortunately, the multiplicative constant grows very fast, essentially it is a tower of 
2 ’s whose height is the number of quantifier-alternations of the monadic second 
order logic formula. In our case the height is at least linear in fc. 

We present a decision algorithm that for fixed fc tests whether the subchro- 
matic number Xs(G) < fc for graphs G with bounded treewidth. The algorithm 
involves dynamic programming to find a proper fc-subcoloring. For simplicity we 
restrict ourself to nice tree decomposition. 
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The nice tree deeomposition of G ([H]) is a tree decomposition such that: T 
is a rooted binary tree and for each i G V (T) at least one of the following cases 
applies: 

— z is a leaf and \Xi\ = 1, then we call i a leaf node. 

— i has one son j and Xi = Xj U {n} for some v G V{G) \ Xj, then we call i 
an introduee node. 

— i has one son j and Xi = Xj \ {u} where v G Xj, then we call i a forget 
node. 

— i has two sons j,f, and Xi = Xj = Xj> then we call i a join node. 

Any tree decomposition can be transformed in linear time into a nice tree 
decomposition of the same width |13| . 

Denote by Gi the subgraph of G induced by vertices of Xi and G- the sub- 
graph of G induced by vertices of [jj^iXj, where j > i means that j is a 
descendant of i i.e. i lies on the path from j to the root of T. 

Let ^ be a fc-subcoloring of Gi, then the color clique of ft is any inclusion- 
maximal set K C V{Gi) = Xi, such that all vertices of K have the same color 
under (j) and are mutually adjacent in G. In other words, a color clique is any 
clique that belongs to a color class of 4>. 

We introduce a new data structure (Tabi)igv(T’) such that each entry Tab^ 
consists of several pairs {(f>, growi^j,), where each (f> is a, feasible fc-subcoloring 
of Gi, that might be extended to a subcoloring of G'i and growi^j, is a function 
assigning to each color clique K oicf a boolean variable which helps us to properly 
define a new A:-subcoloring in the inductive step. Note that a single (j) may occur 
in some entry of Tab^ several times with different functions growi^j,. However, as 
G has bounded tree width, the number of all pairs (</), growi^jf) is bounded by a 
constant. The evaluation of Tab^ goes as follows: 

1. If z is a leaf node then Tab^ contains all k possible A:-subcolorings (f of Gi = 
({u},0) and for the only color clique K = {u} and all (f set groWi^j,{K) = 

true. 

2. Let z be a forget node with the son j, and Tabj has been already com- 
puted. Then let Tab^ contain all entries from Tab^, restricted to set Xi. If 
K was a color clique of some fc-subcoloring {<f>' ,groWj^j,') G Tabj, and (j) 
is the restricted fc-subcoloring, then set groWi^fj{K \ {u}) = false only if 
K contains the vertex v = Xj \ Xi. For all other color cliques L of (f> let 
groWi^j,{L) = groWj^j,>{L). Remove duplicated entries in Tab^, if some exists. 

3. Let z be an introduce node with the son j, v G V(G) is the added vertex 
and Tabj is already known. Then for every pair ((f' , growj^j,') G Tabj and 
every fc-subcoloring (f of Gi, such that </> restricted onto Xj is equal to 4>' , 
find a color clique K of (p containing v. If K = {u} or groWj^j,i{K \ {u}) = 
true, then add into Tab^ entry ((p, growi^j,) where groWi^j){K) = true and set 
groWi^(j{L) = groWj^j,i{L) for all other color cliques L ^ K of p. 

4. Let z be a join node with sons j and f and p a fc-subcoloring of Xi. Then 
for all possible combinations of {p, growj^j,) G Tabj and {p, growj'^j,) G Tabj/ 
add the entry {p, growj^cf, A growj/^jf) into Tab^ if and only if for each color 
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clique K oi (j) the value of groWj^^(K) V growj' ^^{K) is true. Again, if some 
entries are present more times, store only one. 

5. Compute the values of Tab^ for all vertices i in the tree, as described in steps 
1-4. The graph G allows a A:-subcoloring if and only if the table entry Tab^. 
is nonempty for the root r. 

To see that the algorithm is correct we think that steps 2, 3 and 4 deserve 
further explanation. 

In step 2 we remember in the function grow the fact, that a certain color 
clique K has already lost a vertex v, and future extension of K by v' would 
cause that the color class will contain induced P3, since the edge {v,v') does not 
belong to G. Therefore in step 3 we try extend only those color cliques which 
might be extended. The same argument is used in step 4, since it is impossible 
to identify two color cliques when both of them already forget a vertex. Note 
that various functions growi^^ for a single /c-subcoloring 4> may appear during 
executing steps 2 and 4. 

This discussion concludes the proof of the first part of Theorem[Hl For a graph 
G with tree-decomposition of width c the decision of /c-Subcolorability can 
be performed as fast as the evaluation of the table (Tabi)igy(T’), that is in time 
0(n2'^fc'^+^). This expression is linear in n. 

Note that finding the minimum k such that G allows a fc-subcoloring can be 
done in time by running at most n tests for all k < n. 

4 Cographs 

Cographs belong to the class of graphs with bounded cliquewidth. Due to the 
results of Courcelle [H| problems that are expressible in monadic second order 
logic are linear time solvable in graph with bounded cliquewidth, but the con- 
stants involved are equally bad as in the case of graphs with bounded treewidth, 
as discussed in beginning of Sectional as in the case for graphs with bounded 
treewidth. Hence there is a linear time algorithm to decide fc-SuBCOLORABiLiTY, 
k fixed, for graphs of bounded cliquewidth. In the full version we also show that 
when k is part of the input, the fc-SUBCOLORABiLiTY is polynomial solvable for 
graphs of bounded cliquewidth. 

In this section we present a O(n^) algorithm to compute the subchromatic 
number of cographs (graphs of bounded cliquewidth at most 2). In particular, 
fc-SuBCOLORABiLiTY {k arbitrary) is efficiently solvable for cographs. 

Cographs [6] are inductively defined as follows. 

— Every single vertex graph is a cograph. 

— If Gi and G 2 are two (disjoint) cographs, then the disjoint union G\ U G 2 
and the join Gi -I- G2 are cographs. 

The join graph Gi -I- G2 is obtained from Gi U G2 by adding all edges between 
vertices in Gi and vertices in G2. 

With each cograph G we can associate a rooted binary labeled tree Tq, called 
the cotree of G. The leaves of the cotree Tq represent vertices of G. Internal nodes 
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of Tc have a label U or +. If G = Gi U G2, respectively, G = Gi + G2, then the 
root of Tq carries label U, respectively, label +, and its two sons are root nodes 
of the subtrees representing Gi and G2, respectively. 

There is a linear time algorithm for recognizing whether a given graph G is 
a cograph and, if so, for constructing a cotree Tq of G (see 0 ). 

Our algorithm for determining the subchromatic number of a cograph relies 
on the following notion. A subcoloring (j) oi & graph G is of type {i,j) if 4 > has 
exactly i color classes each of which induces a clique (called small classes) and 
exactly j remaining classes (called big classes). If 4 > is of type (i,j), we also call 
4 > an (i, j) -subcoloring. 

Consider an (i, j)-subcoloring of a graph G which arose by disjoint union 
U or by join + of two graphs Gi and G2. In the following, for each t = 1 , 2 we 
denote by (pt be the restriction of 4 > on Gj and assume that is of type (it,jt)- 

Lemma 8 . // G = Gi + G2 then for any (i, j)- subcoloring p of G it holds: 
i > max{ii, 12} and j = ji + j'2. 

Proof. The first equality follows from the fact, that any small class of 4 > may 
consist of at most two small color classes, one in <pi and one in <j)2- The second 
equality express the fact that big color class of p is big in exactly one of pi or 

p 2 - 



Lemma 9 . If pi and p2 are subcolorings of type (ii, ji) and (12,^2), then a 
(max{*i, 12}, ji j 2) -subcoloring of G = Gi G2 can be obtained from pi and 

p 2 - 



Proof. A (max{ii, 12}, ji+j2)-subcoloring of G can be obtained by a combination 
min{fi,i2} small classes of Gi and G2. 

Lemma 10 . // G = Gi U G2 then j > max{ji,j2}; * + j > + jt {t = 1 , 2 ), 

and i-\- 2 j >ii-\- ji + 12 + j2- 

Proof. Let 

— r denote the number of the big classes G of p such that, for each t = 1 , 2 , 
G n Gt is a big class in pt^ 

— rt denote the number of big classes G of p such that, for t t', G fl G* is a 
small class in pt but G fl Gf is a big class in pf , 

— qt denote the number of big classes of p belonging only to pt, 

— s denote the remaining big classes of p, which are small both in Gi and G2. 

— It denote the number of the small classes of p belonging to pt. 

The first statement of the Lemma follows directly from: 

* = ^1 + ^2, j = f + ri + r2 + gi + 52 + s, 

*i = ri + Zi + s, Ji=r + r 2 + gi, 

*2 = ?'2 + ^2 + S , J2 = + + 92 - 
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Moreover, i\ + j\ = i + j — h — 92 and 12 + 92 = * + 9 — ^1 — 9 i, hence the 
second statement holds. The third statement then follows from ii+ji + 12+92 = 
2(* + 9) - ^1 - ^2 - <?i - 92 = * + 2j - 9i - 92- 

In view of Lemmas [8] and | 9 ] we are interested in (i,9')-subcolorings with small 
number 9 of big classes. A way to obtain such a subcoloring of G = Gi U G2 
from (f)i and 4>2 is as follows: We first merge the min{9i,92} pairs of big classes 
of 4>i and 4>2, and then combine as many as possible of the \j\ — 92I remaining big 
classes together with some small classes into a new color class of 4>. The number 
of remaining small classes of 4>i is fci := A — min{ii,max{0,92 — 9i}}. Similarly 
4>2 contains k2 '■= %2 — min{i2, max{0,9i — 92}} remaining small classes. 

Note that k\ = i\ (if 91 > 92) or ^2 = *2 otherwise. Finally we combine fc, 
Q < k < min{fci,A:2}, small classes of (j)\ with k small classes of (j)2 and get a 
{k\ + k2~ 2k, max{9i,92} + fc)-subcoloring (j)' of G = Gi U G2. 

This and Lemmas El and E] suggest the following algorithm for determining 
Xs{G), assuming that the cograph G and its cotree Tq with root r are given. 

For each node x of Tq the algorithm stores in a table Tab^, the type (1,9) of 
all possible (i,9')-subcolorings of the induced subgraph Gx of G corresponding to 
X that are relevant for computing ys(G). The tables are computed in bottom-up 
order: 

( 1 ) For each leave x oiTc, put ( 1 , 0 ) into Tab^,. 

( 2 ) If a; has label + and sons X\,X2, then for all combinations of entries (zi,9i) G 
Taba;j and (12,92) G Taba;^ put into Taba, the entry (i,j), where i = max{fi, 12} 
and 9 = 91 +92. 

( 3 ) If X has label U and sons Xi,X2, then for all combinations of entries (ii, 91) G 
Taba,j and (^2,92) G Taba,2 perform the following computation: 

( 3 . 1 ) Set fci := ii — min{ii, max{ 0 , 92 —91}} and 
k2 ■■= i2 - min{i2,max{0,9i -92}}- 

( 3 . 2 ) For each k varying from 0 to min{fci,fc2} put into Taba, the entry (i,j), 
where i = k\ + k2 — 2k and 9 = maxjji, 92} + k. 

( 4 ) Return Xs(G) = min{i + 9 : (1,9) S Tab^}. 

Note that the number of all entries (i, 9) stored in each Taba, is bounded by 
Moreover, as discussed by Lemmas [8] and | 9 ] and after Lemma HOl if {i,j) G Taba, 
then there exists a subcoloring of Gx of type 

We say that + (*,9) if simultaneously holds that 9' < 9 and i' +9' < 

i + 9. It is clear that from a subcoloring of type {i' ,j') any subcoloring of type 
{i,j) with {i' ,j') + (*,9) can be derived by adding extra colors or claiming some 
small color classes as big. 

The following lemma shows the correctness of the algorithm (proof omitted) : 

Lemma 11 . For every node x o/Tq and every subcoloring (j) of Gx of type (i,j), 
there exists a pair (i',f) G Taba, such that {i',f) + (*,9)- 

The direct application of Lemma [TT] concludes the proof of the second part 
of Theorem El in time O(nfc^). 
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Abstract. There exists a polynomial time algorithm to compute the 
pathwidth of outerplanar graphs |3|, but the large exponent makes this 
algorithm impractical. In this paper, we give an algorithm, that given 
a biconnected outerplanar graph G, finds a path decomposition of G of 
pathwidth at most twice the pathwidth of G plus one. To obtain the 
result, several relations between the pathwidth of a biconnected outer- 
planar graph and its dual are established. 



1 Introduction 

Much research has been done to compute the pathwidth of graphs. The notion 
of pathwidth first appeared in the theory on graph minors by Robertson and 
Seymour [8], and is equivalent to several other graph theoretic notions, e.g., 
vertex separation number, interval thickness, node search number. See |2I1| for 
overviews. 

In this paper, we consider the problem to approximate the pathwidth of 
outerplanar graphs. In [3j, it was shown that the pathwidth of graphs with 
bounded treewidth can be computed in polynomial time. As outerplanar graphs 
have treewidth two, we know that pathwidth is polynomial time computable for 
outerplanar graphs. However, the exponent in the running time of this algorithm 
is rather large — already one step in the algorithm requires to work with sets 
of size 0{n^^). In this paper, we give a linear algorithm, which approximates 
the pathwidth of a 2-connected outerplanar graph with multiplicative factor 
two. Our algorithm is based on structural results on the relation between the 
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pathwidth of a 2-connected outerplanar graph and its dual, which are interesting 
in their own right. This ‘dual’ relation combined with the results of Ellis et al. 
^ that the pathwidth of trees can be computed in linear time are the main 
ingredients of our algorithm. Also, we show how to construct the corresponding 
path decomposition in O(nlogn) time. 

In [^, Govindran et al. give an 0(n log n) time algorithm for approximating 
the pathwidth of an outerplanar graph with a multiplicative factor of three. We 
improve upon this paper for biconnected outerplanar graphs. 



2 Definitions and Notations 

We use the following notations: G = {V,E) is an undirected and finite graph 
with vertex set V and the edge set E which is assumed to be without self-loops 
or parallel edges. A plane graph is a particular drawing of a planar graph in 
the plane without crossings. An outerplane graph is a planar embedding of an 
outerplanar graph with every vertex on the exterior face. Edges of an outerplane 
graph that are not on the boundary of the exterior face are called internal. If 
G = {V, E) is a plane graph then G* = (V*,E*) denotes its geometric dual. The 
weak dual of a plane graph G is the graph obtained from the dual G* by deleting 
the vertex corresponding to the exterior face of G. 

Observation 1 The weak dual of an outerplane graph is a forest. 



Observation 2 The weak dual of a 2-eonnected outerplane graph is a tree. 

The notion of pathwidth was introduced by Robertson and Seymour . (See 
1^ and [7| for surveys.) A path decomposition of a graph G = (V, E) is a sequence 
(Xi, AI 2 , . . . ,Xj.) of subsets of V, such that 

“ Ul<i<r ~ 

— For all {v, w} £ E, there is an i, 1 < i < r, with v,w G Xi. 

— For all 1 < io < ii < i 2 < r, Xi^ n C X^^. 

The width of path decomposition (Xi, X 2 , . . . , X^.) is maxi<i<r|Xi| — 1. The 
pathwidth of a graph is the minimum width over its path decompositions. 

The notion of treewidth is strongly related to the notion of pathwidth. In 
this paper, we use a variant of this notion, which we call semi treewidth. 

A semi tree decomposition of a graph G = (V, E) is a pair ({X^ | i £ /}, T = 
(/, F)), {Xi \ i G 1} a, collection of subsets of V and T = (/, E) a tree, such that 

-\J.^iXi = V. 

— For all {u,w} £ E, there is an z £ / with v,w G Xi, or there are nodes 

io,H S I with V £ X^^, w G X^^, and zq and i\ are adjacent in T. 

— For all zo, zi, Z 2 £ I, if ii is on the path in T from zq to Z 2 , then X^,, n X^^ C 
X,,. 
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The width of a semi tree decomposition {{Xi \ i £ I},T = {I, F)) is max^g/ \Xi\ — 
1, and the semi treewidth of a graph is the minimum width over its semi tree 
decompositions . 

The definition of tree decomposition is obtained by replacing the second 
condition in the definition above by 

— For all {v, w} £ E, there is an i £ I with v,w £ Xi. 

The treewidth of a graph is the minimum width over its tree decompositions. No- 
tice that path decomposition of a graph G can be defined as a tree decomposition 
with a tree T being path. 

3 Pathwidth of Outerplane Graphs 

The main purpose of this section is to prove the following theorem. 

Theorem 1. Let G be a 2-connected outerplane graph without loops and multi- 
ple edges, and let G* be the dual of G. Then: 

pw(G*) < pw(G) < 2pw(G*) -I- 2. 

Proof. By Lemma El pw(G*) < pw(G). By Lemma El there is a planar inner 
triangulation H of G such that pw{H*) < pw(G*) -I- 1. Notice that pw(G) < 
pw(iL). Applying Lemma|7]for H* we have that pw(iL) < 2pw(iJ*). So we have: 

pw(G*) < pw(G) < pw(iL) < 2pw(iJ*) < 2pw(G*) -1-2. 



□ 

The lemmas needed for the proof above will follow in the remainder of this 
section. We need the following fact about pathwidth of trees. 

Theorem 2 (Ellis et al. [4]L 

1. Every tree T of pathwidth k 1, k > 1, has a vertex u such that the forest 
T\ {u} has at least three connected components of pathwidth > k; 

2. For any tree T, pw(T) < k 1, k > 1, if and only if there is a path P such 
that every connected component of the forest T \ V{P) has pathwidth < k. 



Lemma 1. Let G* = {V* , E) be the dual and let T = (Vr, Eji) be the weak dual 
of a 2-connected outerplane graph G = (V, E). Then pw(T) -I- 1 = pw(G*). 

Proof. Let v £ V* be the vertex corresponding to the exterior face of G, he., 
T=G*\ M. 

Let us prove that pw(G*) > pw(T) -|- 1. Suppose that pw(T) = k. Then by 
Theorem El there is a vertex u £ Vr such that at least three branches at u have 
pathwidth > fc — 1. Let Ti,T 2 ,T^ be these branches. 
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Let (Xi, X 2 , . . . , Xr) be a path decomposition of G* . In this path decom- 
position there are sets Xi. containing at least k vertices of Tj, j G {1,2,3}, as 
the pathwidth of each Tj is at least fc — 1. Without loss of generality, we sup- 
pose that ii < i 2 < h- Choose a G Xi^ \ Xi^ and h G Xi^ \ Xi^. Then Xi^ is 
an (a, 6)-separator. But there are at least two vertex disjoint (a, 6)-paths in G* 
containing no vertex of T 2 (one passing through u and one through v.) Hence 
pw(G*) > (A: -I- 2) — 1 = fc -I- 1. 

The proof of pw(G*) < pw(T) -|- 1 is obvious because T = G* — v. □ 

Lemma 2. Let G be a 2-connected outerplane graph without loops and multiple 
edges. Then pw(G*) < pw(G). 

Proof. Let T = {Vt, Et) be the weak dual of G. By Lemma[Tl pw(T) = pw(G*) — 
1. We now will show that pw(T) < pw(G) — 1, using induction to the pathwidth 
of T. In the case pw(T) = 1 the result clearly holds. 

Suppose now that for every 2-connected outerplane graph having a weak dual 
of pathwidth < fc, the lemma is correct. 

Let G be a 2-connected outerplane graph and T be its weak dual with 
pw(T) = fc -I- 1. Then by Theorem there is a vertex u G Vt such that the 
graph T \ {u| has at least three components Ti,T 2 ,T^ of pathwidth > fc. Let 
Hi,H 2 , H 3 be the subgraphs of G having Ti, T 2 , T 3 as weak duals. These graphs 
are 2-connected and outerplane and by induction hypothesis pw(iJi) > fc -I- 1, 

{1,2,3}. 

Because G is a 2-connected outerplane graph without loops and multiple 
edges, the face of G corresponding to vertex u is bounded by a cycle G of length 
> 3. See Figured] for a schematic diagram. Notice that for every i G {1,2,3} 
the subgraph Hi has some vertices of G and there is a path pi in G from Hj 
to Hk avoiding Hi, i ^ j ^ k. Let (Xi,X 2 , . . . ,Xu) be a path decomposition 
of G*. In this path decomposition there are sets Xi^ containing at least fc -|- 2 
vertices of Hj, j G {1,2,3}. We may assume, without loss of generality, that 
ii < *2 < * 3 . Then Xi^ separates and X^j. Therefore Xi^ contains a vertex 
of p 2 and |Xi 2 |>fc-|-2-|-l = fc-l-3. □ 

A triangulation of a plane graph G is a maximal plane supergraph, i.e. a 
plane supergraph of G with every face (including the exterior face) a triangle. 

An inner triangulation of a plane graph G is a plane supergraph of G, such 
that all interior faces are a triangle, i.e., we allow the exterior face not to be a 
triangle, and no edges are added to the outerface of G. Notice that every inner 
triangulation is a maximal outerplane graph. 

Lemma 3. Let G be a 2-eonnected outerplane graph without loops and multiple 
edges. Then there exists an inner triangulation H of G such that pw(G*) > 
pw{H*) - 1. 

Proof. Consider the following face split operation on plane graphs: take an in- 
terior face that is not a triangle, and add an edge between two non-adjacent 
vertices of the face. Clearly, when we repeat the face split operation until it is 
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Fig. 1. Illustration to the proof of Lemma E] 



no longer possible, we end up with an inner triangulation of the original graph. 
It remains to be shown that we can do this such that the pathwidth of the dual 
graphs do not change. 

The face split operation on G corresponds to the following operation on the 
dual G*, or, similarly, to the weak dual T. Take a vertex v of degree at least 
4 in G*. Let N[v] be the set of the vertices that are adjacent to v. Assume a 
clockwise ordering of the neighbors of u, Ui, U 2 , . . . , Us- (The ordering is forced 
by the embedding, up to a cyclic shift.) Now, partition the set fV[u] into two 
sets M and N, where M consists of the vertices in some consecutive part of the 
ordering, i.e., M = {vi,Vi+i, . . . ,Vj}, l<i<j<s. M and N have size at 
least two. Now, transform G* as follows: delete v with all its incident edges, add 
new vertices u and w with an edge {u,w}, and make u adjacent to all vertices 
of M and w adjacent to all vertices of N. Notice that the degree of u in new 
graph is |A^| + 1 and the degree of w is |M| + 1. We say that the result of this 
transformation is obtained from G* by a vertex splitting of v. A graph H* is 
said to be a vertex split of G* if H* is obtained from G* by a sequence of vertex 
splittings. 

Notice that if G = {V^E) is an outerplane graph and F* = (Vp,Ep) is a 
vertex split of G* = (V*,E*) then F = {Vp^Ep) is outerplane and Vp = V, 
Ep ^ E. 

Let T be a weak dual tree of G. We wish to prove first that there is a split 
Ts of T such that pw(T) > pw(Ts) — 1 and A{Ts) = 3. 
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By Theorem |2] every tree of pathwidth < fc + 1 has a path P such that every 
branch at this path (connected component of T \ V{P)) has pathwidth at most 
k. Using this fact one can find the split Ts easily. In fact, choose the path P as in 
Theorem[^and split (if necessarily) the vertices of P. Since the new path has the 
same branches, such splittings do not increase the pathwidth. Then we can split 
the branches of P recursively, unless such a branch consists of a single vertex. 
Splitting a branch that is a single vertex means that the vertex is replaced by a 
path: this increases the pathwidth of the branch by one. 

If we do the corresponding face splits to G, then we obtain the desired inner 
triangulation H of G. In fact, by Lemma[TJ pw(G*) = pw(T) + I and pw(iJ*) = 
pw(Ts) + I. Therefore pw(G*) = pw(T) + I > pw(Ts) = pw{H*) — 1. □ 

Note that a graph remains outerplanar when we apply an inner triangulation. 

Lemma 4. Let Go be a graph, and let Gi be obtained from Go by removing all 
vertices of degree two whose neighbors are adjacent. Then pw(Gi) < pw(Go) < 
pw(Gi) + 1. 

Proof. As Gi is a subgraph of Go, clearly, pw(Gi) < pw(Go). 

Suppose we have a path decomposition (Xi, X 2 , . ■ . , Xj.) of Gi. For each 
vertex v in Gq with degree two whose neighbors w, x are adjacent, find a set 
Xi with w,x G Xi and add after Xi in sequence of the path decomposition a 
set Xi U {u}. This gives a path decomposition of Go whose width is at most one 
larger than the given path decomposition of Gi. □ 

Lemma 5. Let H be a 2-connected inner triangulated outerplane graph. Let T = 
{Vt,Et) be the weak dual of H . Let H~ be the graph, obtained by removing all 
vertices of degree two from H. Then there is a semi tree decomposition {{Xi \ i G 
Vt},T) of H~ with width 1. 

Proof. Choose an arbitrary leaf node vq from T, and view T as & rooted tree 
with root Uq. We take = 0. For all nodes w ^ Vq, consider the edge from w 
to its parent in T. This edge is dual (crosses in the diagram) an edge, say {y, z} 
from H. Then take X^, = {y,z}. 

We claim that {{Xi \ i G Vr},T) defined in this way is a semi tree de- 
composition of H~ . We will verify the second and third condition of semi tree 
decomposition; the first then follows directly from the second. 

Consider an edge {y, z} from H~ . If {y, z} is an internal edge, then it crosses 
an edge say {u, w} from T. Suppose v is the parent of w in T. Then, by construc- 
tion, {y,z} = Xu,. Now, suppose {y,z} is not an internal edge. Suppose {y,z} 
is adjacent to an internal face, represented by a vertex v from T. As the graph 
is triangulated, this face must also be a triangle. The face must be adjacent to 
two other internal faces in H otherwise y or z would have degree two in H, and 
hence {y,z{ would not be an edge in H~ . Say these faces are represented by 
nodes w and x. As the root of T is a node of degree one in T, either w or x is 
a child of u in T. Suppose u> is a child of u in T. Then y G X„ and z G X^, or 
z G Xy and y G Xy,. (See diagram [21) 
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Fig. 2. Illustration to the proof of Lemma [5] 



Finally, for a node z in note that the edges from T that cross an edge 
with z as endpoint form a path in T . By the choice of a leaf node as root of T, 
this is a directed path and hence the collection of sets that z induces a path 
in T. □ 



Lemma 6. Let {X, T) be a semi tree decomposition of a graph G = {V, E) with 
width k. Then, 

pw(G) < {k + l)(pw(T) + 1 ) — 1 . 



Proof. Let (Yi,l2, ■ ■ • ,Yr) be a path decomposition of T of pathwidth pw(T). 
Consider the sequence {Z\, Z2 , . . . , Z^) with Zi = U^gy, 

We now will verify that this sequence is a path decomposition of G. First, 
suppose < Z2 < *3) and v G Zi^ n Hence, v S Xj^ for some j\ G Yi ^ , and 

V G for some j’3 G Yi^. By the properties of path decomposition, we have 
that must contain a node from the path from j\ to j’3 in T ; call this node j’2 • 
By the definition of semi tree decomposition, v G , and hence v G Zi,^ . 

Next, consider an edge G E. If there is an edge {11,12} in T with 

V G and w G Xi.^, then there must be an j, 1 < } < r with i\,i2 G Yj. Hence, 
v,w G Zj. The case that there is an i G / with v,w G Xi is easy. 

One directly sees that the width of this path decomposition is at most (fc + 
l)(pw(T) + 1) - 1. □ 



Lemma 7. Let El be a 2 -connected inner triangulated outerplane graph with 
dual H* . Then pw(iL) < 2 • pw(iJ*). 

Proof. Let H~ be the graph obtained by removing the nodes of degree two from 
H . Note that, as H is triangulated, the neighbors of every vertex of degree two 
in H are adjacent. By LemmaS] pw(iJ) < pw(iL“) + 1 . Let T be the weak dual 
of H . By LemmaEl there is a semi tree decomposition {X, T) of H~ of width 1 , 
hence by Lemma E] pw(iL“) < 2 • pw(T) + 1 . Now 

pw{H) < pw{H~) + 1 < 2 • pw(T) +2 = 2 - pw(iJ*) 



□ 
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4 An Approximation Algorithm 

for Biconnected Outerplanar Graphs 

In this section, we give an algorithm, that given a 2-connected outerplanar graph 
G = (V,E) with n vertices finds a path decomposition of G that has a width that 
is at most two times the pathwidth of G plus 2. The algorithm uses 0(n log n) 
time, and follows the structure of the proof, given in the previous section. Note 
that if we just want to have a bound on the pathwidth of G, then the algorithm 
can run in linear time. 

Step 1: Remove loops and parallel edges. If we allow our input graph to have 
self-loops (edges of the form {u,u}), or parallel edges, then we just remove such 
edges: this does not change the pathwidth of the graph. 

Step 2: Compute an outerplane embedding of G. It is well known that given a 
graph G* , one can test in linear time if G* is outerplanar, and if so, find an 
embedding of G* with all vertices on the exterior face. See e.g. 

Step 3: Compute the dual graph G* of G. It is well known, that, given a plane 
embedding of a planar graph, one can find its dual in linear time. 

Step 4- Compute the pathwidth of the dual graph G* . The pathwidth of the 
dual graph G* can be computed in linear time, in the following way. First, we 
take the tree T that is obtained from G* by removing the vertex v representing 
the exterior face, i.e., T is the weak dual of G. Using the algorithm of Ellis 
et al. [1], we can compute in linear time the pathwidth of T, and compute 
a path decomposition (Xi, . . . , Xr) of T of optimal width in O(nlogn) time. 
The pathwidth of G* is one larger, by Lemma H] and the path decomposition 
(XiU{u}, X 2 U{u}, . . . , Xj.U{u}) is a path decomposition of G* of optimal width. 
By Euler formula the number of vertices in G* is 0{n) and the pathwidth of G* 
can be computed in 0{n) time. 

If we are satisfied with a bound on the pathwidth of G that is at most two 
times plus 2 larger than the exact bound, then, by Theorem [I] we are done, as 
2pw(G*) -k 2 is such a bound. However, if we want a path decomposition of G 
whose width at most two times plus 2 than the pathwidth of G, then more work 
has to be done. 

Step 5: Compute an inner triangulation H of G, such that pw(iJ*) < pw(G*)-|-l. 
In this step, we make the proof of Lemma El constructive, i.e., we compute an 
inner triangulation H of G, such that the pathwidth of G* equals the pathwidth 
of H*, where H* is the dual of H. 

Let T be again the weak dual of G. Suppose the pathwidth of T is fc. 

Now, first we find a path P in T, such that all branches at this path have 
pathwidth at most k. This can be done in linear time, using a minor modification 
of the algorithm from Ellis et al. [3] . When we have the path P, the split can easily 
be applied. After this, we continue recursively on the branches. As the pathwidth 
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of a tree with t vertices is 0(log t), every edge of T is involved in 0(log t) recursive 
steps, and the total time of this step becomes 0{t\ogt) = O(nlogn). 

Step 6: Compute a path deeomposition for the weak dual of H . Compute the 
weak dual T of the inner triangulation computed in the previous step, and 
use the linear time algorithm of Ellis et. al. |4] to compute an optimal path 
decomposition of T. 

Step 7; Compute the semi tree decomposition of H~ . Let H~ be the graph 
obtained by removing the vertices of degree two from H . Following construction 
described in the proof of Lemma [5l we can make a semi tree decomposition 
(X, T) of width 1, with T the weak dual of H. It is not hard to see that this step 
can be done in linear time. 

Step 8: Make a path decomposition of H~ . The proof of Lemma can be made 
constructive in a straightforward way; we thus obtain a path decomposition of 
H~ of width 2pw(T) + 3 = 2pw(iJ*) + 1 of H~ . 

Step 9: Make a path decomposition of H . Finally, we have to add back the 
vertices with degree two, while increasing the pathwidth with at most one, as in 
the proof of Lemma [H This can be done in linear time by using the following 
method: 

Suppose that the vertices in H are numbered v\, ... ,Vn- We use also a boolean 
array Z with indexes {1, . . . , n}. Initially, all Z[i] are false. 

1. Suppose that S is the set of vertices of degree two. Make a set P of (ordered) 
pairs of vertices, with for every v G S, its two neighbors as two ordered pairs 
in P, i.e., if v’s neighbors are x and y, then both (x,y) and (y,x) belong to 
P. Maintain pointers from and to a vertex v G S and its corresponding pair 
in P. 

2. Radix sort P, i.e., P is sorted with respect to the lexicographic ordering. 
This can be done in linear time, using the standard radix sort algorithm. 

3. Now, we can build an array, with the ith entry pointing to a list of vertices 
V for which {vi,v) belongs to P. This can be done in linear time, using the 
sorted list of the previous step. 

4. Next, visit the bags in the path decomposition of H~ one by one. For each 
bag Xi, we do the following: 

(a) for every vj G Xi, make Z[j] true. 

(b) for every vj G Xi\ Xi-i and for every Vj/ with (vj,Vji) G P, check if 
Z[j'] is true. If it is true, then we found the bag for the neighbors of a 
vertex v, namely the one corresponding to the pair (vj,Vji). We remove 
this pair and its reversed copy {vji, Vj) from the array of step|3] and add 
a pointer from v to bag Xi. 

(c) for every Vj G Xi, make Z[j] false, i.e., now all values in Z are again 
false. 
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5. Now, for every vertex v of degree two in H , we can follow the pointer from v 
to a bag Xi. Add after Xi a bag which contains X^ U {z;}. Note that in this 
way, we never create bags that are more than one larger than a bag in H~ . 

When (n, w) and {w, v) belong to P, then consider the first bag that contains 
both V and w. In this bag, we either will consider z; or zc in step 14b I and hence 
the vertex of degree 2 associated with this pair will be pointing towards this bag. 

The time is linear in the size of the path decomposition plus the size of H. 
The pathwidth of a tree with k vertices is 0(log k) and by construction the 
number of vertices in a path decomposition is 0{n). Therefore this step can be 
performed in 0{nlogn) time. 



Step 10: Obtain a path decomposition of G. The path decomposition of H ob- 
tained in Step 9 is also the path decomposition of G and the width of this 
decomposition is at most 2pw(G) -I- 2. 



5 Concluding Remarks 

One of the most interesting question about pathwidth of outerplanar graphs 
we left open is the existence of fast practical exact algorithms or algorithms 
approximating pathwidth of outerplanar graphs with an additive factor. One 
of the possible ways in obtaining such algorithms is the proof of the following 
conjecture. 

Conjecture 1. There is a constant c such that for any 2-connected outerplanar 
graph G without loops and multiple edges pw(G*) < pw(G) < pw(G*) -I- c. 

Moreover, we suggest that 

Conjecture 2. For any 2-connected planar graph G without loops and multiple 
edges pw(G*) - 1 < pw(G) < pw(G*) -f 1. 

Path decompositions of trees and of outerplanar graphs with n vertices can 
have 0{n) bags Xi of size 0(logn), thus, when using a straightforward rep- 
resentation one may need already l7(nlogn) time just for writing the output. 
However, more compact representations of path decompositions exists, e.g., mark 
for each vertex the first and last bag it belongs to, or one can use the equiva- 
lent notion of vertex separations. These representations have size linear in the 
number of vertices. As Skodinis |S] has shown that (with such representations) 
one can find an optimal path decomposition of a given tree in linear time, we 
conjecture that the algorithm of Section 2] can be made to run in linear time, 
but there are several unresolved matters in this, and we leave this as an open 
problem. As a side remark, we note that the algorithm of Govindran et al. [S] 
can be made to run in linear time, using Skodinis’ algorithm and a corresponding 
representation of the path decompositions. 
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Abstract. Submodular functions have appeared to be a key tool for 
proving the monotonicity of several graph searching games. In this paper 
we provide a general game theoretic framework able to unify old and 
new monotonicity results in a unique min-max theorem. Our theorem, 
provides a game theoretic analogue to a wide number of graph theoretic 
parameters such as linear- width and cutwidth. 



1 Introduction 

A considerable part of graph theory is oriented to the following general problem: 
Given a non-trivial graph property P, find a complete characterization of the 
graphs that do not satisfy P. In general, such a characterization is achieved by 
describing some “forbidding” structure whose existence in G obstructs P from 
being satisfied. As a first example we mention the Kuratowski theorem asserting 
that a graph is non-planar iff it topologically contains an obstructing structure of 
two forbidden graphs. Many graph theoretic parameters have been characterized 
by their obstructing analogue and this characterization is typically achieved by 
a so called min-max theorem (for a good source of min-max theorems, see 0). 
Several examples of such characterizations emerge from the work of Robert- 
son and Seymour on their Graph Minors series. As a sample, we mention the 
characterization of treewidth via screens j2l], of branchwidth via tangles [20] , 
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of pathwidth via blockages jS], and of carving- width via antipodalities [22] (see 
also lisTTT] ). 

In many cases, it appears that min-max theorems are useful on the study 
of graph searching games. Roughly, a graph searching game has as objective 
to capture an omniscient fugitive residing on the vertices or/ and the edges of 
a graph by systematically moving a specific number on searchers on it. Such 
a search strategy can in general allow “recontamination” in the sense that it 
can let the fugitive occupies parts of G that have already been searched. If this 
is not the case the strategy is called monotone. The “monotonicity question” 
for a graph searching game asks whether “allowing recontamination” makes the 
capturing of the fugitive easier than in the case where the player restricts his/her 
attention only to monotone strategies. 

It frequently appears that the monotonicity question can be answered with 
the use of a suitable min-max theorem. The reason is that, in the majority of 
the cases, the minimum number of searchers required to capture the fugitive is 
equivalent to some known graph theoretic parameter. The existence of a min- 
max theorem for this parameter implies the existence of a forbidden structure 
that, in turn, indicates an escape strategy for the fugitive no matter whether 
the searching strategy is monotone or not. That way, the min-max theorems 
for treewidth and pathwidth, developed in m and [2], were the cornerstones 
for proving the monotonicity of the corresponding graph searching variants (see 
also j7j). 

Our paper is motivated and constitutes an extension of the ideas in the proofs 
of the monotonicity of the agile fugitive search games examined in 1181141181231^ . 
IM6] as well as the proofs of the min-max theorems in and [^. Our 
main observation is that the kernel argument of all these proofs is based on 
the fact that, in any game variant, the cost of the search can be expressed 
by a connectivity function that is a nonnegative-valued function on the set of 
subsets of a set M that is invariant over complement and satisfies the submodular 
property. 

In this paper we show how a graph parameter can be generated by a con- 
nectivity function and we prove a general min-max theorem for it. Moreover, we 
define for any such parameter a one-player conquest game and we use our min- 
max theorem in order to prove its monotonicity. Our game framework is general 
in the sense that it provides a big variety of games depending on the choice of the 
connectivity function a that generates it. In particular, for suitable choices of a, 
it provides obstruction characterizations, game counterparts, and monotonicity 
proofs for the parameters of linear- width, cutwidth and their extensions. Finally, 
our general min-max theorem implies in a uniform way the monotonicity proofs 
of all the agile fugitive search games developed so far in |1,8l18l28l4l1bj . 

To illustrate the main motivation of our research let us give a simple example 
of an “expansion” game. Suppose that we have a set of countries subject to 
join some organization. At every moment of time we can either add a bounded 
number of countries to the union or expel an arbitrary number of countries. 
Adding countries and keeping them in the union needs some resources, say for 
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guarding its border from the countries outside the union. The question is how 
to add all the countries using the minimal amount of resources. 

There is a number of interesting questions concerning this game. One of them 
is the “monotonicity question” : can it be useful at some step to expel countries 
from the union? Another is the “min-max question” : what kind of structure in 
a map provides necessary and sufficient conditions for obstructing the intended 
expansion? In this paper we provide the answers to both questions for a more 
general version of expansion game. 

The paper is organized as follows. The main min-max theorem is proved in 
Section[2 In Section[^we present our general game, and in Section [^we present 
the consequences of the min-max theorem on its variants. In Section [SI we end 
up with further examples, remarks, and open problems. 

2 A Min-max Theorem for Connectivity Functions 

Given a finite set M, a function a mapping the subsets of M to integers is called 
a connectivity function for M m if the following two conditions are satisfied. 

'^AGM ci{A) = a{A) (we denote A = M — A), (1) 

'^A,BCM o:{A U B) a(A n B) < cr(A) -|- cxi^Bf (2) 

Notice that for any X C M, a{X) > a(0) because 2a(X) = a{X)+a{M—X) > 
a(0) + a(M) = 2a(0). It is more convenient for our purposes to think that 
a(0) = 0 (replacing a'(X) = a(X) — a(0) if it is not the case). 

For a given a finite set M, let O be a set of subsets of M and let a be a 



connectivity function for M. A sequence A = {Ai, ... ,Aj.) is a 
in O (for M with respect to a connectivity function a) 


{k^m) -expansion 


Vi,i<i<r AiC M and a{Ai) < k, 


(3) 


Ai,Ar G O, 


(4) 


'^iA<i<r—l A^\ ^ m. 


(5) 


If, additionally. 




A.[ G 


(6) 


then we say that A is monotone. Finally, a (A:, m) -expansion in {0}, i.e. expan- 
sion with Ai = 0 and Aj. = AI, is referred to as complete. 

For facilitating the notation, in this section we will assume that all the expan- 
sions are defined with respect to a fixed connectivity function a. The following 
lemma uses the ideas of Bienstock-Seymour’s monotonicity proof for graph cru- 



sades H]. 

Lemma 1. If there is a {k,m)- expansion in O for M then there is a monotone 
(k,m)- expansion in O for M. 
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Proof. The proof of this lemma is omitted due to space restriction. It can be 
found in the full version of the paper [TO] . □ 



Lemma 2. Let A = (Ai,A 2 , . . . ,Ar) be a monotone {k,m) -expansion for M. 
Then A = {Ar, A^-i, . . . ,Ai) is also a monotone (fc, m)-expansion for M. 

Proof. Since Ai = Ai and a{Ai) = a{Ai), we have that A satisfy (EJ and (H]). 
For i S {1, ... ,r — 1}, A A^ implies that A^ — Ai^i = — A^ and 

Ai+i C Ai. Therefore, © and (|bj) are also satisfied. n 

For any integer fc, we define a {k,m)~ obstacle for M as the set O such that 

(01) Each A G O is a subset of M with a{A) < k. 

(02) li A G O, B Q A and a{B) < k then B G O] 

(03) If A,B,C C M, An B = 0, a{A) < k, a{B) < k, \C\ < m, and AUBUC = 
M then {A G O A B ^ O) or {A ^ O A B G O). 

The aim of this section is to prove that the existence of a {k, m)-obstacle for 
M, obstructs the existence of a complete and monotone {k, m)-expansion for M 
and vice versa. In particular, we will prove the following min-max theorem. 

Theorem 1. Let M be a finite set and a a connectivity function on M . Then 
the following are equivalent. 

(i) There exists no (k, m)- obstacle for AI . 

(ii) There exists a complete (k^m) -expansion for M . 

(Hi) There exists a complete and monotone {k, m)- expansion for AI . 

For the proof of Theorem [1] we need first to prove Lemma [3] below. A set 
O is partial (k,m) -obstacle for AI if it satisfies (ol) and (o2) and if there is no 
(fc, m)-expansion for it. The next lemma is an analog of the blockage theorem 
(2.5) in [ 3 ]. 

Lemma 3. Every partial (k,m)- obstacle for AI is a subset of a (k,m)- obstacle 
for AI. 

Proof. Let O be a partial (fc, TO)-obstacle for AI. We assume that every O', 
where \0'\ > \0\ and satisfying (ol) and (o2), either is not a partial {k,m)- 
obstacle for AI, or is a subset of a {k, m)-obstacle for AI. As base of the induction 
we can consider a set Oq containing any subset A of AI, where a{A) < k. 
Indeed the set Oq is not a partial (fc, m)-obstacle. In fact, 0,M G Oq because 
a{0) = a{AI) = 0 < k and (0) is a (A:, m)-expansion in Oq. 

If O satisfies (ol), (o2) and (o3) then O is a (A:, m)-obstacle for AI and the 
lemma holds. Suppose that O does not satisfy (o3). Then there exist Ai,A 2 ,C C 
AI such that 



— a{Ai) < k, 0 (^ 2 ) < k, \C\ < m; 

- Ai U A 2 U C = M and Ai n A 2 = 0; 
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— Ai,A 2 G O or Ai,A 2 ^ O. 

For our purpose it is more convenient to work with complements of Ai and A 2 - 
Let Ti = Ai and T 2 = A 2 - Notice that 

ct(T'i) < k and a{T 2 ) < k, 

TiUT2 = M, 
iFi n T 2 | < m 

We claim that 

Ti,T2^0. ( 10 ) 

Indeed, notice first that, from m, Ai n A 2 = 0 and thus T\ D A 2 - By (o2), 
if Ti G O then A 2 G O. Since A 2 G O, we have that Ai G O. Then (Ai) is a 
(fc, m)-expansion in O, which is impossible. The proof of T 2 ^ O is similar. 

We now choose Ti , T 2 satisfying m-m such that |Ti| is minimal. We claim 
that 



(7) 

( 8 ) 
(9) 



If X C Ti and a{X) < k then either X £ O, or X = Ti, or Ti £ O. (II) 

We assume that X ^ O and X ^ T\ and we will first prove that X £ O. For 
this, we will show that if X ^ O, then X, X satisfy ©-dinj, contradicting the 
choice of Ti, T 2 . Indeed, (E} follows from a(X) < k and (ED; (ED is obvious and (E]) 
follows as X n X = 0. We now conclude the proof of m noticing that Ti C X 
and (o2) gives Ti £ O as required. 

For z = 1, 2 we define Oi as the set of all X CT^ with a(X) < k. Notice that, 
for i = 1,2, Ti ^ O implies \OUOi\ > \0\. Therefore, the result follows from the 
induction hypothesis if we show that for some i = 1, 2, OUOi is a partial (fc, m)- 
obstacle for M. Clearly, (ol) and (o2) are satisfied for both O 0 Oi,i = 1,2 and 
therefore, it remains to prove that, for some z = 1, 2, there is no (fc, m)-expansion 
in OUOi- Assume, towards a contradiction, that, for z = 1,2 there are {k,m)- 
expansions in O U Oi and O U O 2 and, by Lemma [T] we can also assume that 
they are monotone. 

Let us prove that 

There is a monotone (fc, m)-expansion X = (Xi, X 2 , . . . , X^) in Oi (12) 

with Xi = Ti and X^ G O. 

llTi £ O then X = (Ti) is a (fc, m) -expansion in Ox satisfying fl1 211 . 

Suppose that Ti ^ O. Let X = (Xi, X 2 , . . . , X^) be a monotone {k,m)- 
expansion in OUOi. Then Xi, X^ G Ox. By LemmaEl X = (X^, X^-i, . . . , Xi) 
is also monotone (fc, m)-expansion in O0 Ox- 

One of the sets Xi and X^ is not in O because there is no (fc, m)-expansion 
in O. W.l.o.g we can assume that Xi ^ O. (Otherwise we can consider X.) As 
Xi G O U Ol, we have that Xi G Ox and, from the definition of Ox, we conclude 
that Xi C Ti. By (fTTH . (because Xi ^ O and Tx ^ O) Xx = Tx- 
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From the monotonicity of X it follows that T\ = X\ Q Xr- We claim that 
either X^ ^ Oi or X^. = 0. Indeed, if X^. G Oi, then the definition of Oi implies 
that Xr C Ti. Moreover, Ti C Xr implies that Xr C Ti and Xr = 0 follows. 

Notice now that GSl holds trivially if Xr = 0 G O. Moreover, if Xr ^ 0\, 
then Xr G OU Oi implies that Xr G O and this concludes the proof of (USD- 
Let 3^ = (Fi, > 2 , ■ • • , Ys) be a monotone {k, m)-expansion in 02- Since there 
is no (fc, m)-expansion in O we have that either Yi or is not in O. W.l.o.g. we 
can assume that Yi ^ O (otherwise we replace 3^ by 3^). Then Yi QT 2 , yielding 
iW-WiI < |T 2 -Ti| <_m. 

We now claim that Yr ^ O. Indeed, if this is not the case, then 
(Xr,Xr-l,...,X^,Yi,Y2,...,Y,) 

is a (A:, m)-expansion in O, contradicting to the fact that there are no (k,m)- 
expansions in O. 

Recall that Yg G OUO2 and, as Yg ^ O, we get that Yg G O2, thus Yg C T2- 
But Yg 0 T 2 combined with ©, implies |Xi - W| = |Ti - W| < |Ti - T 2 I = 
\Ti r\T 2 \ < m. As a consequence, 

(Xr,Xr-l,...,X^,Yi,Y2,...,Yg,XuX2,...,Xr) 

is a (fc, m)-expansion in O. This is again a contradiction and the lemma is proved. 

□ 



We can now proceed with the proof of Theorem [H 

Proof (Proof of Theorem QJ). The fact that (ii) (iii) follows directly from 
Lemma [H 

Next, we will prove that (Hi) => (i). Suppose, in contrary that O is a {k,m)- 
obstacle for M and A = (^ 1 , . . . , Ar) a complete and monotone (fc, m)-expansion 
for M. Recall that from dS]) we have that a{Ai) < k and from (o2), 0 = Ai G O. 
(o3) now implies Ar = M ^ O. Let i be the smallest z, 0 < t < r such that Ai+i ^ 
O. Let Ai+i — Ai = C and (0 gives \C\ < m. Notice that AiVJCVJ = M. 
Moreover, AiHAi+i = 0 and therefore (o3) implies that Ai ^ 0,a contradiction. 

It remains now to prove (z) => (ii). It is enough to show that there exists 
some (A:, m)-expansion in {0}. Indeed, if this is not the case and since {0} 
satisfies (ol) and (o2), we get that {0} is a partial (A:, m)-obstacle for M. From 
LemmalU {0} is the subset of some (A:, m)-obstacle for M, a contradiction. □ 

3 A General Framework for Conquest Games 

In this section we will introduce a general framework of a game on graphs, 
hypergraphs and sets based on the notion of (A:, m) -expansions. 

In particular, we assume that a is a connectivity function for some set M . 
The elements of M represent countries to be conquested. In the beginning, all 
the elements of M are considered unoccupied. The conquest proceeds in steps. 
Each step can either be the occupation of at most m new elements of M or the 
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retreat from some already occupied elements of M . We define a move as a pair 
P = where S' is a set of elements of M and oj equals 0 when these elements 

are removed from the set of conquested elements (retreat-move) and 1 when these 
elements are added to the set of conquested elements (attack-move). A conquest 
strategy for M is a sequence £ = (Pi , . . . ,Pr) of moves. The aggressivity of a 
conquest strategy is max{|S| | (S, 1) G £}, i.e. the maximal number of elements 
occupied at some step. Given a conquest strategy S, we recursively define the 
occupation sequence of £ as the sequence (Tq; • ■ ■ j T^.) where Tq = = Ti_iUS 

if Pi = {S, 1), and Ti = Pi_i — S' if Pi = (S, 0). The a-cost (or simply cost when 
there is no doubt about a) of a conquest strategy £ is defined as maxo<i<r ca{Ti). 
We call a conquest strategy monotone if it does not contain any retreat move. 
We call a conquest strategy successful if Tr is the set of all the elements of M. 

Lemma 4. For any set M and any connectivity function a, there exists a suc- 
cessful (monotone) conquest strategy with a-cost k and aggressivity m if and only 
if there exists a complete (monotone) {k, m)- expansion for M with respect to a. 

Proof. Let (Ai, . . . , Ar) be a complete (monotone) (fc, m)-expansion for M with 
respect to a. We construct a sequence of moves £ = (P{, Pi, . . . , P)[_i, Pj._i) 
such that for every 1 < * < r — 1, 

(p! p \ / ((^i ~ 0)) ~ 1))> if Q^(^i-i LI ^i) ^ 

Kt'pn) - I 0)), if a(A,_i U A) < k. 

Notice that, Q and the fact that a{Ai) -\- a{Ai+i) < 2k imply that either 
a(Ai_i n Ai) < k or a(Ai_i U Ai) < k. Therefore, £ is well defined. It now 
follows directly from the definitions that S is a successful (monotone) conquest 
strategy with a-cost k and aggressivity m. Given now a successful (monotone) 
conquest strategy for G with a-cost k and aggressivity m, it is easy to see that its 
occupation sequence is a complete (monotone) (fc, m)-expansion for V{G) with 
respect to the function a. □ 

Given a set M and a connectivity function a, we define a (A:, m)-ordered 
partition of M as a linear ordering (Pi, . . . , Bj.) where {Pi, . . . , P,.} is a partition 
of M, Vyi<i<r \Bi\ < m, and Vyi<i<r. a(Pi U • • • U Pi) < k. For any set M, we 
define the {a, m) -width of M as the minimum k for which there exists a {k,m)- 
ordered partition of G. 

We can now conclude this section with the following. 

Theorem 2. Let M be a set, a he a connectivity function for M, and k,m two 
integers. The following assertions are equivalent. 

1. There exists a complete {k,m)- expansion for M with respect to the function 
a. 

2. There exists a complete monotone (k,m)- expansion for M. 

3. There exists no {k,m)- obstacle for M with respect to the function a. 

4-. The (a,m)-width of M is at most k. 
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5. There exists a successful conquest strategy for G with aggressivity m and 
a-cost k. 

6. There exists a successful monotone conquest strategy for G with aggressivity 
m and a-cost k. 

Proof. The equivalence of (1), (2), and (3) follows directly from Theorem [T] The 
equivalence of (1) and (5) as well as the equivalence of (2) and (6) follows from 
LemmalU It remains to prove that (4) and (6) are equivalent. Let (Vi, . . . , Vr) be 
a (k, m)-ordered partition of I^(G). We construct E = ((S'!, 1), ■ ■ ■ , {8^., 1)) where 
for i = Si = Vi and observe that £ is a complete monotone {k,m)- 

expansion for V(G), as required. Finally, ii £ = ((^i, 1), . . . , {Sr, 1)) is a success- 
ful monotone conquest strategy, then it is enough to notice that (5'i, . . . , Sr) is 
a (fc, m)-ordered partition of V{G). □ 

4 Applications 

In this section we will give some examples of games on graphs where the conquest 
game framework introduced in Section [2] can provide a min-max theorem. 

4.1 Definitions 

We give first general definitions of some well known width-type parameters for 
graphs. 

The cutwidth of graphs has been extensively considered and emerged as a 
tool for the study of VLSI layouts (See [1 11 7j for further references) . We give 
below a natural generalization of its definition. 

Let iJ be a graph with the vertex set V{H) and the edge set E{H). For 
X C V{H) let ao(-^) be the number of edges incident to vertices in X and 
V {H)—X. Let (j) = {Bi, B 2 , ■ ■ ■ , Bn), maxi<i<„ \Bi\ < m, be an ordered partition 
of V{H). For 1 < i < n we put Vi = Wj^^Bj and 

m-cw{H,(j))= max a^fiVi) 

ie{l,...,n} 

The m-cutwidth of H is min{m- cw(iJ, (f): (f is an ordering of V {H)}. So, in our 
terms, the m-cutwidth of a graph H is equivalent to its {uq , m) -width. If in the 
definition above we set m = 1, we have the definition of cutwidth. 

The notion of linear- width for graphs was introduced by Thomas |2tiJ . Let 
G be an undirected and finite graph with vertex set V {G) and edge set E{G). 
For X C E{G), let a\{X) be the number of vertices incident to edges in X and 
E{G) — X. Let a = (ei, 62 , . . . , Cm) be an ordering of E{G). For i £ {!,..., m} 
we put Ei = Ut^iCj. We define 

lw(G,tT);= max ai{Ei), 

and the linear-width of G is min{lw(G, a): cr is an ordering of E{G)}. In a sim- 
ilar way, one can define the notion of linear-width for hypergraphs. In our terms 



Monotonicity of Games and Symmetric Submodular Functions 



185 



the linear- width of G is the -width of G. Certainly, it is possible to ex- 

tend the linear-width to m-linear-width considering ordered partitions of edges 
in the fashion we did for m-cutwidth. This would define a parameter equivalent 
to (ai,m)-width. Notice that both definitions given in this subsection can be 
generalized to hypergraphs and can have various extensions based on a weight 
assignment function for the vertices or the (hyper)edges of G. Any of these ver- 
sions are equivalent to some (a, m)-width for suitable choices of M, a and m. 



4.2 A Game for Cutwidth 

We consider first the case where M is the vertex set of a graph G and a maps any 
of its subsets S to the number of edges with endpoints in both S and V{G) — S. 
That way, (a, m)-width is the m-cutwidth of G and Theorem [I] provides an 
obstruction characterization and a min-max theorem for m-cutwidth (which in 
case m = 1 is simply the cutwidth of G). For a more intuitive approach, we 
can consider the vertices of G as the countries of the world where existence 
of “common borders” between two countries implies the existence of an edge 
between the corresponding edges. The aggressivity of the game indicates how 
many countries are permitted to be occupied after each attack. The cost function 
a may indicate the resources the player can use in order to keep the occupied 
positions. For a more realistic scenario, we can assign a cost on each of the 
edges indicating the length of the corresponding border portion or simply the 
cost of guarding it. It is easy to check that, in any case, the cost function is a 
connectivity function and, therefore. Theorem [T] indicates that the player can 
restrict his/her attention only to monotone conquest strategies (those that do 
no contain retreats) as this does not imply any deterioration of his/her ability 
to take over the world. 



4.3 Linear-Width and Search Games on Graphs 

Recall that, using the terminology of the previous sections, the linear width of 
a graph G is equivalent to its (oi, l)-width. Theorem [I] provides an obstruc- 
tion characterization and a conquest game for linear- width. In what follows, we 
will show how this conquest game can modelize the three variants of the graph 
searching game. 

A mixed searching game is defined in terms of a graph representing a system 
of tunnels where an agile and omniscient fugitive with unbounded speed is hidden 
(alternatively, we can formulate the same problem considering that the tunnels 
are contaminated by some poisonous gas). The object of the game is to clear all 
edges, using one or more searchers. An edge of the graph is cleared if any of the 
following occurs. 

A: both of its endpoints are oecupied by a searcher, 

B: a searcher slides along it, i.e., a searcher is moved from one endpoint of 
the edge to the other endpoint. 
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A search is a sequence containing some of the following moves: (i) placing a new 
searcher on v, (ii): deleting a searcher from v, (iii): sliding a searcher on v along 
{u,m} and placing it on u. 

The object of a mixed search is to clear all edges using a search. The search 
number of a search is the maximum number of searchers on the graph during any 
move. The mixed search number, ms(G), of a graph G is the minimum search 
number over all the possible searches of it. A move causes recontamination of an 
edge if it causes the appearance of a path from an uncleared edge to this edge 
not containing any searchers on its vertices or its edges. (Recontaminated edges 
must be cleared again.) A search without recontamination is called monotone. 

The node (edge) search number, ns(G) (es(G)) is defined similarly to the 
mixed search number with the difference that an edge can be cleared only if A 
(B) happens. 

The following is a combination of results in |3] and m- 
Theorem 3. For any graph G the following hold: 

— If G^ is the graph occurring from G after subdividing its pendant edges, then 
ms(G) = Iw(G^). (We call pendant any edge with an endpoint of degree 1.) 

— If G® is the graph occurring from G after subdividing each of its edges, then 
es(G) =lw(G®). 

— If G" is the graph occurring from G after replacing every edge in G with two 
edges in parallel, then ns(G) = lw(G"). 

We mention that the mixed search number is equivalent to the parameter of 
proper-pathwidth defined by Takahashi, Ueno, and Kajitani in It is also 

known that the node search number is equivalent to the pathwidth, the interval 
thickness, and the vertex separation number (see 1811211811411^ 1. 

Theorem Ogives a way to transform any searching game to a conquest game 
for linear-width. Therefore, the obstruction characterization for linear-width pro- 
vided by Theorem [I] can serve as an obstruction characterization for any variant 
of the search parameters. Applying Theorem [T] and Theorem [3l we can directly 
derive the monotonicity results of ITOT6] for the three variants of the search 
game. In particular, we have the following. 

Theorem 4. Let G be a graph and k, m two positive integers. The following 
assertions are equivalent. 

1. There exists a k-expansion for E(G^) / E(G^) / E(Gp) with respect to the func- 
tion tti. 

2. There exists a monotone k- expansion for E{G'^) / E{G'^) / E{G^) with respect 
to the function ai . 

3. There exists no {k, m)- obstacle for E{G^)/ E{G^) / E{G^) with respect to the 
function ai. 

4-. There exists a node/edge/mixed search for G using k searchers. 

5. There exists a monotone node/edge/mixed search for G using k searchers. 
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5 Conclusions 

It appears that Theorem [3 gives a framework to prove the monotonicity for any 
search/conquest game based on some connectivity function a. Several interesting 
versions of the conquest game can be generated like that. In particular, we can set 
a to be any connectivity function mapping vertex sets of V (G) to non negative 
integers. As an example, we can define a : V (G) — {0, . . . , |F(G) |} such that for 
A C V(G), a(A) is the number of endpoints of the edges in {{u,u} \ v £ A,u £ 
V{G) — A}, i.e. the number of countries lying on both sides of the frontier of 
the occupied area defined by A. By observing that this version of a is indeed a 
connectivity function, we can directly apply Theorem [T] and derive a min-max 
theorem as well as the monotonicity property of the corresponding game. 

As a last application of Theorem [U we define a conquest game on a plane 
graph where now the regions represent countries and the cost of the strategy is 
the maximum number of vertices incident to its frontier. This game seems similar 
to the “dual” version of the game of Subsection 14.31 when restricted to planar 
graphs. The only difference is that now the cost is determined by the vertices of 
the frontier and not by the edges. In both cases, the cost function is a connectivity 
function and the monotonicity property is a consequence of Theorem [3 We 
conjecture that the two games are equivalent, in the sense that when they have 
the same aggressivity, the existence of an optimal complete conquest strategy 
for the one implies the existence of an optimal complete conquest strategy for 
the other. 

Another way of obtaining game-theoretical approaches to width type param- 
eters is the study of conquest games with an “average” criteria of optimality. For 
example, instead of the a- cost of a conquest strategy £ one can define the a- sum- 
cost of a conquest strategy £ as J2o<i<r Similar to LemmaHl it is possible 

to prove monotonicity results for expansion games with minimal sum-cost. This 
idea can be used to obtain the monotonicity result for the game related to the 
sum bandwidth problem [S] (or optimal linear arrangement). (See also jS] for 
similar approach to the interval completion problem.) Nevertheless, to find an 
analog of Theorem [T], if any exists, appears to be an interesting and hard open 
problem. 

In conclusion, we find interesting the question whether there are monotone 
search or conquest games where the cost is not expressed by a connectivity 
function. 
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Abstract. The input for the hotlink assignment problem consists of a 
node weighted directed acyclic graph with a designated root node r. The 
goal is to minimize the weighted shortest path length rooted at r by 
adding a restricted number of outgoing arcs (hotlinks) to each node. 
The {h, fc)-hotlink assignment problem is defined on fc-regular complete 
trees, and at most h hotlinks can be assigned to each node. We contribute 
algorithms for the (1, fc), (2, k), and (fc— 1, k) hotlink assignment problem. 



1 Introduction and Preliminaries 

Fast access to large data bases is a main challenge in information technology. 
Probably one of the largest and most distributed data bases is the world wide 
web (WWW). A natural representation of the web is a directed graph, where 
nodes represent web pages and arcs represent links from one page to another. 
Given a root page and node weights which represent the access frequency of the 
corresponding pages, one tries to minimize the expected number of steps which 
are needed to get from the root page to the information nodes. This immediately 
corresponds to the well known shortest path tree problem, which motivates the 
restriction of the input graphs to the class of rooted trees. 

The Hotlink Assignment problem is a network upgrade problem: We are 
allowed to add new arcs to the graph in order to insert shortcuts and decrease 
the expected path length. 

The problem has been introduced in pp. The authors investigate the case 
where the underlying graph is a complete binary tree and only the leaves have 
positive access weight. We generalize the results to complete fc-regular trees. 

Before defining the problem we briefly recall some definitions. A tree rooted 
at node r is a tree in a directed graph, where all nodes can be reached from the 
root node r. The distance d{v, w) between two nodes v and w is the number of 
arcs on the directed path from v to w. The out-degree (or simply degree) of a 
node is the number of arcs leaving that node. A node of degree zero is called a 
leaf. If all internal nodes have equal degree k, the tree is called k-regular. The 
distance d{r,v) from the root r is called the depth of node v. The height of 
the tree is the maximal depth. If all leaves have equal depth, the tree is called 
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complete. Let {v,s) and {s,g) be a directed path of length 2 in the tree. Then 
g is the son of s and the grandson of v, while p{g) := s is the father of g. A 
node v' ^ V is a brother of node v if p{v') = p{v). 

Definition 1 ((/i, fc)-Hotlink Assignment problem). An input instance of 
Hotlink Assignment consists of a k-regular complete tree G = (V,R) rooted 
at a node r € V where L C V is the set of leaves. Additionally there is a node 
weight function w: L ^ R()" with 

The problem allows to construct a graph G' by inserting at most h arcs per 
node V, which are starting at v and ending at a node in the subtree hanging 
from V. The goal is to find an assignment such that the expected path length 

E[G', w] := w{v) ■ dc (r, v) 

v£L 



is minimized. 

For convenience, the weight function w is extended to the total node set V 
by assigning each inner node the sum of the weight of all leaves in the subtree 
hanging from it. This can be achieved in linear time. In the sequel we use the 
vertex’ name also to denote its weight. 

Throughout the paper we denote by H the height of the tree. Observe that 
E[G,w] = T[ for the initial graph. 



Related Work 

The (1, 2)-Hotlink Assignment has been investigated in [T]. The authors con- 
sider several weight distributions on the leaves. In particular, they present an 
algorithm with expected path length 3/4 • iL -|- 1/4 for arbitrary and uniform 
distribution, and argue that this is optimal for uniform distribution. 

The (1, 2)-Hotlink Assignment has been proven to be NP-hard even if 
restricted to general acyclic graphs [1]. A slight modification of the construc- 
tion can be used to show the NP-hardness of the (/i, /c)-Hotlink Assignment 
problem for any fixed h G N. We omit the proof due to lack of space. 

There are variants of the problem where the total sum of additional hotlinks 
is bounded rather than the number of hotlinks per node. In this situation it is 
an optimal strategy to assign all hotlinks starting at the root of the tree. This 
problem variant is known as bookmark assignment (see [I] and the references 
therein) . 

The Hotlink Assignment problem is a special case of a graph augmentation 
problem. Those types of problems are specified by a graph G = {V, E) and a 
subset Eq C E which induces the initial graph G[Eq]. The goal is to find a 
minimum cost set E' d E oi edges such that the augmented graph G[Eq U A'] 
satisfies some requirements. Augmentation problems have first been considered 
in |3] where an algorithm for obtaining 2-connectivity in an unweighted graph 
has been given. A survey on connectivity augmentation problems can be found 
in [B] . Other requirements include a bound on the diameter |3] . 
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Network improvement problems are another category of problems on graphs, 
where weights and costs on edges and nodes are subject to change rather than 
the structure of the graph itself. For the case of a shortest path tree, Berman 
provided an algorithm which improves the length of a shortest path tree by edge 
weight modifications [^. 

2 (fc — 1, fc)-Hotlink Assignment 

We start with the investigation of (fc — 1, /c)-Hotlink Assignment. As pointed 
out in the introduction, an instance of this problem consists of a complete k- 
regular tree. For each vertex we are allowed to add up to fc — 1 hotlinks. 



2.1 The Algorithm 

The algorithm is a generalization of the approach presented in [T] for (1, 2)- 
Hotlink Assignment. The main idea is a recursive assignment starting from 
the root. For each node the algorithm determines a set of up to A: — 1 grandsons of 
maximal weight and assigns hotlinks pointing to them. While descending some 
care must be taken in order to avoid useless hotlinks. The reason why each 
hotlink spans exactly two levels is discussed in Section 15.11 

We now describe the algorithm. See Figure [T] for an illustration and Figure |2] 
for a detailed description. 




Fig. 1. Tree structure T (left) and S (right). 

The algorithm starts with assigning an hotlink to root r according to a T- 
structure T{1), where I is the height of the structure. It identifies a grandson g 
of r of maximal weight and adds the hotlink (r, g) to the graph. 

Let p := p{g) be the father of g. Obviously, a hotlink from p into the sub- 
tree rooted at g is useless, because for all affected paths the same gain can be 
performed by using the just added hotlink (r, g) instead. Hence the algorithm 
can ignore the subtree hanging from g in this situation. This is performed by 
assigning fc — 1 hotlinks to p according to ^'-structures S{1 — 1). All remaining 
brothers of p are treated recursively according to a T-structure T(l — 1) as just 
described. 
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T-Structure 

Input: Tree with root node r and tree height I 

1 if 1 > 2 then 

2 determine grandson g of r with maximal weight 

3 insert hotlink (r, g) 

4 call algorithm r-STRUCTURE(gi, I — 2) 

5 for each brother g' of g do 

6 call algorithm S'-STRUCTURE(p(p), g' , I — 1) 

7 end for 

8 for each brother p' of p{g) do 

9 call algorithm T-STRUCTURe(p', I — 1) 

10 end for 

11 end if 

S-STRUCTURE 

Input: Tree with root node r and son p and tree height I 

1 if 1 > 2 then 

2 determine grandson g of r with maximal weight 

3 insert hotlink (r, g) 

4 call algorithm r-STRUCTURE(gi, I — 2) 

5 for each brother g' of g do 

6 call algorithm S'-STRUCTURe(p, g' , I — 1) 

7 end for 

8 end if 



Fig. 2. Algorithm for (fc — 1, fc)-HoTLiNK Assignment. 



The assignment within an 5'-structure S{1) with root r is as follows: The al- 
gorithm identifies a grandson g of r of maximal weight and adds the hotlink (r, g) 
to the graph. Similarly to the above case, the father p{g) should be treated as a 
root of S'-structures S{1 — 1) in order to avoid useless hotlinks. 

The recursion can be terminated at height I = 1. We remark that the sug- 
gested algorithm uses the full power of assigning k — 1 hotlinks only at nodes 
rooting an S'-structure. 

2.2 Analysis of the Algorithm 

We introduce the following notation for the nodes in tree T{1): The sons of root r 
are denoted by Ui for i = 1, ... ,k. The sons of node Oi are denoted by for 
j = 1, ... ,k. Without loss of generality we can assume that the weight of on 
is a maximum of all grandsons. (Notice that from this assumption we can not 
deduce that weight w{a\) has maximal weight among the brothers of ai.) In 
tree S{1), we denote the grandsons of root r by 6i, . . . , bk, where bi is weight 
maximal. 
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Observe that if we consider average path lengths in a subtree, the weights of 
the leaves must be considered as being normalized: 

k k k k 

“b = H Qii — 1 and — 1 . (^) 

i—l j—l i=l i—1 

Denote by si and ti the expected path length in a tree S{1) and T{1), respec- 
tively. Then we have Sq = to = 0 Si = ti = 1. From the algorithm we can 
deduce the following recursion 

b = 1 -I- an • t;_2 -I- (oi - an) • s;_i -I- (1 - Oi) • t;_i 
S; = 1 -|- &i • ti-2 -|- (1 — bi) ■ Si-i 
which we will prove to have a solution of the form 



ti < a ■ I + /3 
Si < a ■ I . 



( 3 ) 



We briefly describe how to guess suitable values for a and /3. The conditions 
for Si in 0 and the fact bi > 1/k, and the assumption a > (3 yield (3 < 
{k + l)a — k. Then the condition for ti is satisfied if 



1 — a • (1 -I- On) -I- (an — ai) • /3 < 0 . 



( 4 ) 



Clearly, an — Oi < 0. On the other hand, from ([T]) and the fact that an > a^ 
we can conclude that an — ai < — fc -I- l)an — 1 =: d • an — 1. The latter 

estimation is sharper for an < 1 /d. 

Thus, with an > 1/d and an — ai < 0, equation dU is satisfied if 



a > 



1 - fc -k 1 

1 -k 1 /d “ k^ -k + 2 



and 



P < {k + 1) ■ a — k = 



k^-k + 2' 



If we use equality as a definition for a and /?, one can easily verify that © holds 
also for the remaining case an < 1 /d. 

Lemma 2. Let a := 0 ''nd P := • Then reeursion (0 yields 



ti < a ■ I + P and si < a ■ I . 



Proof. The proof uses induction on 1. As observed above, we have Sq = to = 0 
and Si = ti = 1 which show the claim for I — 0 and 1 = 1. By inspection, one 
observes that t2 = 2 — an < 2 — 1/fc^ and S2 = 2 — 6 i <2 — 1 /fc which both 
satisfy the claim for I = 2. 

Consider 1 = 3. Observation to < 3 is sufficient for the claim if fc > 3. 
Otherwise, if fc = 2, we have < 5/2 = 3/4 • 3 -I- 1/4. For S 3 we observe 
S 3 < (3fc^ — fc — (fc — 1 )) /k'^, and it is easy to verify that this satisfies the claim 
for t = 3. Notice that these bounds on si and ti for I < 3 are actually tight for 
the case of uniform distribution. 

Thus the induction hypothesis is shown for I < 3. The induction step is valid 
by the construction of a and P as described above. □ 
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Theorem 3. There is an algorithm for problem {k — 1,/c)-Hotlink Assign- 
ment with expected path length bounded by 

— k + 1 {k — 1)^ 

— k + 2 kP' — k + 1 

where H is the height of the input tree. The running time is linear in the number 
of nodes. □ 

As noted before, the algorithm for the {k — l,fc)-HoTLiNK Assignment 
problem does not use the full power of assigning k — 1 hotlinks to all nodes: the 
nodes which are a root of a T-structure have only one outgoing hotlink. 



Second phase 

Input: Tree structure from first phase 

1 Let r be the current root 

2 if r is the root of a T-structure then 

3 Let h G {0, 1} be the number of hotlinks currently emanating from r 

{ We can assign k — 1 — h more links } 

4 Let flu be the target of the hotlink which emanated from r at end of first phase, 

ai := p(aii) 

5 Choose k—l — h nodes {oi} from brothers of oi such that the targets of hotlinks 
emanating from those nodes are weight maximal 

6 Replace each of those hotlinks (oi , x) by (r, x) 

7 end if 

8 Descend recursively to all sons of r 



Fig. 3. Second phase of algorithm for {k — 1, fc)-HoTLiNK Assignment. 



We claim that the results for (fc — 1, /c)-Hotlink Assignment can be im- 
proved for A: > 3 by applying a second algorithm afterwards (cf. Figure |3]), 
which uses the following idea: Descending recursively in the tree, each time 
when a node r is found which has not already k — 1 outgoing hotlinks, then iden- 
tify sons s of r where a hotlink (s, x) emanates from, and replace that hotlink 
by (r, a;). If one ensures that during this phase no “overlapping” hotlinks are 
produced, then there is a nonnegative net improvement of the weighted path 
length. For the case of uniform probability distribution one can show by an 
analysis similar to the above that the upper bound on the path length can be 
decreased to 



+ f -iZ + Od) 



2(fc- l)fc2 



by the modification. 



3 (2, fc)-Hotlink Assignment 

We now turn over to problem (2, fc)-HOTLiNK Assignment. An instance of this 
problem consists of a complete /c-regular tree, where at most 2 hotlinks per node 
can be assigned. 
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The idea of the algorithm is similar to the previous. We refer to Figure |4]for 
an illustration and to Figure [5] for a detailed description of the algorithm. For 
each node, we identify two grandsons gi, g2 with maximal weight and assign two 
hotlinks to them. 





Fig. 4. Tree structure Tm- 



Tm-STRUCTURE 

Input: Tree with root node r of degree m and tree height I 

1 if 1 > 2 and m > 0 then 

2 determine two grandsons gi ,Q 2 of r with maximal weight 

3 insert hotlinks {r,gi) and (r, 32 ) 

4 call T/;-STRUCTURE(gi, I — 2) for both gi and 32 

5 if p{gi) = p{g2) then 

6 call Tfe_ 2 -STRUCTURE(p(gi), I — 1) 

7 else 

8 call Tfe_i-STRUCTURE(p(gi), I — 1) for both gi and g2 

9 end if 

10 for all remaining sons w of r do 

11 call algorithm Tfe-STRUCTURE(u, I — 1) 

12 end for 

13 end if 



Fig. 5. Algorithm for (2, A:)-Hotlink Assignment. 

Let Tm be a tree structure which describes a tree whose root is of degree m. 
To avoid useless hotlinks, we proceed analogously to the strategy given in the 
previous section: Let pi := p{gi) and P2 ■= p{g2)- If Pi = P2, then we assign the 
father p\ a tree structure Tk-2- All brothers of pi are assigned a tree structure T^. 
If Pi yf p2, then both fathers are assigned a tree structure Tk-i- Again, the 
remaining brothers are assigned a r^-structure. Notice that in contrast to the 
strategy given in the previous section, the current strategy assigns two hotlinks 
to each node (except the lower levels). 
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Denote by the expected path length in a tree Consider the two 

hotlinks assigned to the root. If their targets share the same father, we have 



c = 1 + a • + {b- + (1 - b)tf_, 

k 

where a := an + Oi 2 and b ay 

Otherwise, if their targets have different fathers, then 
= 1 + a • tf_2 + (6 — + (1 — 

k 

where a := an + 021 and b := + 02i) 



( 5 ) 



( 6 ) 



Again we conjecture < a - 1 + p. To determine a and (3 we insert both dS|) 
and dSl) into the conjecture and get an overall estimation 

tj" < a/ + /3 + (1 — (a + 1 ) 0 ) . 



To satisfy the conjecture, the rightmost term must vanish, i. e., bounded from 
above by zero. This happens if, using a > 2/mk > 2/fc^, we choose a > 

This holds for arbitrary /3. 



Lemma 4. For arbitrary I G N and m G {k — 2, k — 1, k} we have 

r.2 p 

< . / _| 

' - fc2 + 2 ^ 2 ■ 



Proof. The proof is by induction on 1. For I = 2 one observes that 

t^ = 2-a<2-^<2-^. 
mk 



The right hand side is bounded by a • 2 + /3 for arbitrary (3 > ■ Notice 

that f3 has not been determined yet. For convenience, we choose the slightly 
weaker bound f3 := ^ 2^2 point. 

Consider 1 = 3. We propose that the weights are uniformly distributed. By 
observation, for fc > 3 we have 



t-<3- 



4fc2 -4 



k* 



< 3- 



fc2 + 2 



< 






fc2 + 2 



2 

yfc2 + 2 ■ 



For k = 2 the above estimation is valid only if the hotlinks point to different 
subtrees. If both hotlinks point into the same subtree, we must use the weaker 
estimation < 9/4 which follows from the observation that the gain of the 
hotlinks sums up to at least 3/4. 

This shows the induction hypothesis for I = 2 and 1 = 3. From the construc- 
tion of a as described above it follows that the induction step is also valid. □ 
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Theorem 5. There is an algorithm for problem (2, fc)-HOTLiNK Assignment 
with expected path length bounded by 

„ I 2 

fc2 + 2 + 2 

where H is the height of the input tree. The running time is linear in the number 
of nodes. □ 

4 (1, fc)-Hotlink Assignment 

The algorithm from Section[3]for the (2, fc)-HOTLiNK Assignment problem can 
be slightly simplified to solve the (1, /c)-Hotlink Assignment problem. Since 
there is only one hotlink emanating from each node, the distinction of cases (see 
Lines 5 to 9 in Figure O is no longer necessary. As a consequence, one can show 
the following result: 

Theorem 6. There is an algorithm for problem (1, fc)-HOTLiNK Assignment 
with expected path length bounded by 

k^ jj I 1 

fc2 + 1 fc2 + 1 

where H is the height of the input tree. The running time is linear in the number 
of nodes. □ 



5 Conclusions 

5.1 Optimal Hotlink Length 

The two algorithms both use hotlinks which span two levels. We follow |T] to 
show that this is the optimal rule as long as the weight distribution is uniform. 
Consider an arbitrary tree with root r. If there is a hotlink added to the root 
which spans d levels, this hotlink reduces the path length by d — 1 for one node 
of weight The same reduction applies for each additional hotlink (as long 
as the number of hotlinks does not exceed the number of grandsons) since the 
paths are independent. 

Consequently the total gain of h hotlinks where each hotlink spans d levels 
is bounded from above by h • (d — 1) • k~'^. This factor is maximal for 



This matches the result from |T] which states that the maximal gain is attained 
when d = 2 or d = 3 for the case k = 2. For fc > 3, we have 1 < d < 2 and can 
conclude that a hotlink spanning two levels is the optimal choice. 



198 Sven Fuhrmann, Sven Oliver Krumke, and Hans-Christoph Wirth 



5.2 Lower Bounds for Uniform Distribution 

We now generalize the results from [J to prove a lower bound on the quality of 
any (h, fc)-HOTLiNK Assignment strategy. We assume that the weight distri- 
bution is uniform. Consider a subtree of weight w{r) hanging from a root r. As 
argumented above, the maximal gain of any assigned hotlink is attained when 
the hotlink spans two levels. As long as h does not exceed the number of avail- 
able grandsons, the same arguments hold if h hotlinks are assigned to node r. 
Thus, the maximal gain which can be achieved by assigning hotlinks to node r 
is 



In order to derive an upper bound on the total gain in the tree, it suffices to 
sum the gain over all nodes. Observe that at level i we have fc* nodes each of 
weight l//c*. Notice that nodes from the two lowest levels are not counted since 
any hotlink assigned to them does not improve the path length. Hence the total 
gain is bounded from above by 



1 u H -I 



Recall that the weighted path length of the tree without hotlinks equals H. Thus 
we can conclude the following result: 

Corollary 7. There is no strategy for (h, / c)-Hotlink Assignment with ex- 
peeted path length lower than 



H -h- 



H - 1 



— h h 



for each instance with uniform distribution. 



□ 



From the above observations we can derive an algorithm which solves for 
any h < k — 1 the {h, fc)-HOTLiNK Assignment problem on uniform distribu- 
tion to optimality (i.e., the solution attains the lower bound). The algorithm 
starts with the root and descends recursively. For each node r, it indentifies the 
h leftmost sons {si, . . . , s^} and assigns for each Si a hotlink from r to the right- 
most son of Si- Since h < k — 1, this construction guarantees that all hotlinks 
are independent and contribute the maximal gain. 



5.3 Summary of Results 

We have presented algorithms for the (1, fc), the (2, k) and the {k—1, fc)-HOTLiNK 
Assignment problem, respectively. For all algorithms we provided upper bounds 
on the resulting path length. Moreover, we gave lower bounds (valid for uniform 
distribution only) which adjust the quality of the strategies. A summary can 
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Table 1. Results presented in this paper. 



Problem 

(l,fc)-HA 

(2,fe)-HA 
(fc- l,fe)-HA 



upper bound 

(t _ 1 1 fr I 

k'-‘-k+2) ^ k'2-k+2 



lower bound (uniform dist.) 



1 








k-l\ 

) 


■H+^ 



be found in Table [TJ It is not surprising that the factor of the linear term ap- 
proaches 1 as fc — >■ 00 since the ratio of leaves which profit from the hotlinks 
tends to 0 in this situation. 

Table |2]compares the results of the several strategies presented in this paper. 
Observe that, for k = 2, the known problem (1, 2)-Hotlink Assignment is a 
special case of {k — 1, fc)-HoTLiNK Assignment, and that our analysis of the 
(fc — 1, fc)-strategy matches the known results from pp in that case. Furthermore, 
for fc = 3, the strategies {k — l,k) and (2, fc) both operate on 3-regular trees 
allowing two hotlinks per node. Here one can observe that the (2, fc)-strategy 
performs better which may be a consequence of the fact that the other strategy 
does not always assign all possible hotlinks. 

Table 2. Comparison of the strategies. The linear factor a of the terms aH -|- /3 is 
given also in decimal notation. 



(l,fc)-HA (fc-l,fe)-HA (2,fc)-HA 





bound 


a 


bound 


a 


bound 


a 


upper bound 

k = 2 


iH+i 


0.800 


iH+k 


0.750 


IH+I 


0.667 


CO 

II 


9 TT 1 1 

10-“ 10 


0.900 


iH+k 


0.875 




0.818 


lower bound 

k = 2 




0.750 


!^+i 


0.750 




0.500 


k = 3 


IH+I 


0.889 


IH+I 


0.778 


IH+I 


0.778 
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Abstract. A subset of nodes S' in a graph G is called k-dominating if, 
for every node u of the graph, the distance from u to S is at most k. We 
consider the parameter ')k(G) dehned as the cardinality of the smallest 
fe-dominating set of G. For planar graphs, we show that for every e > 0 
and for every k ^ (5/7 + e)D, ')k{G) = 0(l/e). For several subclasses 
of planar graphs of diameter D, we show that 'yk{G) is bounded by a 
constant for k D/2. We conjecture that the same result holds for 
every planar graph. This problem is motivated by the design of routing 
schemes with compact data structures. 



1 Introduction 

Given a graph G = (V, E), denote by (1g{u, v) the distance in G from u to v, and 
by Ek{u) = {z) G E I da{u,v) < k} the ball of radius k centered at u (note that 
Tfe (u) is also defined for non integral k) . We extend this notation to every subset 
S C V, letting Ek{S) = Uugs ^k{u). We say that a subset of nodes S' C ]/ is a 
k-dominating set for G (or, S fc-dominates G, or S covers G) if /fe(S) = V. Let 
"fk{G) denote the cardinality of the smallest fc-dominating set of G. Our goal is 
to bound 'yk(G) for various planar graphs G and values of k. 

To illustrate the properties of the parameter jkiG), let us consider a tree T 
of diameter D. It is clear that if fc ^ D/2, then jk{T) = 1; it suffices to consider 
the center of T. On the other hand, if fc < D/2 then there exists some n-node 
tree Tq of diameter D for which 'yk{To) ^ 2(n — 1)/D. For instance, consider 
To composed of a star with each edge subdivided into q — 1 nodes, where 
n = pq 1, D = 2q and 7 q_i(To) = p. Every two leaves of this tree are at 
distance 2q, thus p nodes of Tq are required to {q — l)-dominate all the leaves. 

More generally, for k ^ D/2, every n-node graph G = {V,E) of diameter D 
satisfies 7 fc(G) < ^Jn{l -b In n). To see this, we use a dual characterization of a k- 
dominating set as a set S which hits the collection {Ek{u) \ u G V}, i.e., such that 
S n Ek{u) ^ 0 for every u gV. Now, note that for k ^ D/2, Ek{x) fl A(y) ^ 0 
for any two nodes x and y. Thus every set Ek{x) fc-dominates G. Hence, either 
there exists a node x such that |Tfc(x)| < a, where a = \/n{l Inn), and we 
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are done, or |/fc(a;)| ^ a for every node x. In the latter case, the claim follows 
by a result of Lovasz m about cover sets, which states that there exists a 
^-dominating set of G of size /3 < n(l -|- lnn)/mina; |/fe(cc)| ^ \/n{l + Inn). 

For planar graphs, it is known that the size of a dominating set (i.e., 1- 
dominating set) is bounded by 3 if Z? = 2, and by 10 if = 3, cf. |T^. Thus, 
for k ^ D/2, and every planar graph G, ^k{G) < 3 if = 2, and 7 fc(G) < 10 if 
Z? = 3. 

A recent result concerns planar triangulations. A triangulation of the plane is 
a planar graph with an embedding on the plane such that each face, except maybe 
the outer face, is a triangle. In it is shown that every planar triangulation G 
in which every internal node (i.e., that does not belong to the outer face) has 
degree at least 6, satisfies 7 fc(G) ^ 2, again for D/2. 

Section Elpresents our main results. For planar graphs, we show that for every 
e > 0 and for every integer k ^ (5/7 -I- e)D, 7 fe(G) = 0(l/e). For outerplanar 
graphs of diameter D, we show that jk{G) ^ 2 for A: ^ D/2. We conjecture 
that ^k{G) is bounded by a constant for every planar graph of diameter D for 
k ^ D/2. 

In Section [3] we show study bounded treewidth and chordality graphs. More 
precisely, we show that if G has treewidth bounded by t, then jk{G) ^ t -I- 1 for 
k ^ [ZZ/2J , and 7 fc(G) = 1 for fc ^ L^/2J + [c/2j if G has no chordless cycle of 
length greater than c. 

The motivation for studying this parameter stems from the design of routing 
schemes with compact data structures. In particular, it is shown in Section |T] 
that for every graph G of diameter D and every fc ^ 0, G has an interval 
routing scheme nUTTI with dilation at most D + k and compactness at most 
\lk{G) + 1. (The dilation of a routing scheme is the length of the longest route, 
and its compactness is the maximum number of intervals over any arc.) Moreover, 
the upper bound on the compactness can be reduced to \'yk{G) + o(jk(G)), 
if jk(G)/ logn — > -|-oo. This result improves (by a multiplicative factor) and 
generalizes the compactness vs. dilation trade-offs of [10] and of [1^. Our result 
also implies that for every constant e > 0, every planar graph of diameter D has 
an interval routing scheme with dilation at most [(12/7 H- e)ZZ] and constant 
compactness. 

2 Plane Graphs 

2.1 Preliminaries 

Hereafter, we assume that G = (V, E) is connected and of diameter D. The 
following basic property is important for the remainder of the paper. 

Proposition 1. Let S C V, let k ^ [Z1/2J, and let u,v ^ Tfc(S'). Then no 
shortest path between u and v goes through any node of S. 

Proof. Assume, towards contradiction, that there is some shortest path from u 
to V that contains a node w € S. Then da{u,v) = dc{u,w) + dG{w,v). Since 
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u,v ^ /fc(S'), necessarily u,v ^ Fkiw). Thus dc(u,w) > k, and more precisely, 
da{u, w) ^ L^J + 1 since the distance is integral. Similarly doiv, w) ^ [fcj + 1. It 
implies that dciujv) ^ 2 [fcj + 2 ^ 2 [U/2J + 2 ^ Z? + 1, in contradiction with 
the fact that D is the diameter of G. □ 

A subset S' C F(G) is a separator of G if G \ S is composed of two or more 
connected components. 

Proposition [T] yields the following immediate corollary, keeping in mind that 
every path joining two nodes of different connected components in G \ S, for 
some separator S, has to go through S. 

Corollary 1. Let k ^ \_D/2\, let S be a separator of G, and let U = P\/fe(S). 
Then there exists a unique connected component C of G\S such that U C V{G). 



2.2 Outerplanar Graphs 

Let us start by proving that for outerplanar graphs, ^k{G) is bounded by a 
constant for k ^ D 12. Recall that a graph H is a minor of a graph G ii H can 
be obtained from G by a sequence of zero or more node deletions, edge deletions 
or edge contractions. We say that G is H-free if it does not contain H as a minor. 
Let us denote by Kp q the complete bipartite graph with p nodes in one partition 
and q nodes in the other. We prove the following. 

Theorem 1. If G is iL 2 ,t+i-/cee then 'fk{G) ^ t for every k ^ D/2, and this is 
tight for a subdivision of Kt^t- 

Proof. Let u and v be any two nodes in G with dG(u,v) = D. The level of 
the node x is defined as its distance from u, denoted l{x) = dG{u,x). Let Vi = 
{x &V \ l(x) = i} for every Q ^ i V, D. Let M be the set of nodes x £ Vk such 
that there exists a path (x, Xi, X 2 , . . . ,xi,v) from a; to w satisfying l{xi) > k for 
every 1 ^ i ^ 

We first claim that the set M has cardinality at most t. To prove this, we 
show that if M contains t+1 distinct nodes then G contains K 2 ,t+i as a minor, 
a contradiction. Indeed, consider the graph H composed of the t+1 nodes of M 
and the 2(t + I) paths linking M to u and v, and delete the edges and nodes of 
G\ H. Then it is easy to see that by contractions we obtain a copy of Al 2 .i+i- 

Now let P = (u. Ml, U 2 , . • . , Up, v) be any path linking u and v and let q be 
the largest subscript such that l{uq) = k. Clearly, l{ui) > k for every i in the 
range q < i ^ p, and thus Uq £ M . Hence every path linking u and v must go 
through the set M which, therefore, is a separator in G. (In particular, u and v 
belong to different connected components of G \ M). 

We now prove (by contradiction again) that M fc-dominates G. Suppose 
that there exists a node x in G such that da{x,M) > k. As observed above, 
M separates u from v, hence either u or v does not belong to the connected 
component of x in G \ M. In the former case we get 



dG{u,x) ^ dciu, M) + dcix, M) > 2k ^ D, 



204 Cyril Gavoille et al. 



and in the latter case 

da{v,x) ^ cIg{v, M) + da{x, M) > D — k + k = D. 

Therefore, either dciujx) or da{v,x) is strictly greater than D, a contradiction. 

We thus get that M fc-dominates G, hence jk{G) < \M\ ^ t, completing the 
proof. □ 

As every outerplanar graph is K 2 ^ 3 ~bee, we get the following. 

Corollary 2. If G is an outerplanar graph of diameter D then ^k{G) ^ 2, for 
every k ^ D/2. 

2.3 Planar Graphs 

We assume from now on that G is a plane graph, that is, a planar graph with an 
embedding in R^. More precisely, the nodes are points of and the edges are 
simple curves that can cross or meet each other on the nodes only. A connected 
subset of is called a region of the plane. Given a subgraph H oi G and a 
point w ^ H (i.e., a point w G that does not belong to an edge or a node 
of H), we denote by reg(w, H) the unique region of \ i? containing w. Note 
that if i? is a tree, then R^ \ consists of one region only. If R^ \ i? is composed 
of exactly two non-empty regions, we denote by feg(w, H) the complementing 
region, such that R^ = reg(ic, H) U feg(w, H) U H. 

Given an integer "0 ^ 3, a tjj-gon w.r.t. G and fc is a subgraph H of G defined 
by a sequence (xi,...,a;^) of distinct nodes, and a sequence (Qi, ■ ■ ■ ,Q^) of 
paths such that the following four conditions hold: 

1. Qi is a shortest path from Xi to mod V')-i-i> every i G {!,..., if}-, 

2. R^ \ i? is composed of at most two non-empty regions; 

3. x^ ^ A(QiU---ug^_2); 

4. d{x^, X(^, mod v-)+i) > k, for every i G {1, . . . 

The point x^jj is referred to as the extremity of the "0-gon. A ip-gon H is called 
weak if conditions 3 and 4 do not necessarily hold. A weak 3-gon is hereafter 
called a triangle. 

Roughly speaking, a V'-gon consists of if shortest paths, each of length at 
least fc-l- 1, possibly sharing some edges. Fig. [T] presents a 4-gon H w.r.t. a plane 
graph G of diameter D = A and k = 2. Note that the xfs may border to different 
regions induced by H . 

Given a path P and two nodes u and v on P, we denote by P[u, u] the 
subpath of P between u and v. 

Let Qi, ... ,Qp be a set of shortest paths, and for each i G {1, . . . ,p} let 
{z\, . . . , z\.) be the sequence of nodes of P fl Qi encountered in this order along 
a walk on P from u to v (possibly = 0 if P fl = 0). The simplified path of 
P on QiU ■■■ U Qp is the path defined by 

S = P[u, zl] U Qi[zl,zlJ U P[zl^,zj] U • • • U Q,[z\,zl/\ U P[z*., z*+^] U--- 
UQp[zlzl]UP[zlM . 
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X. 




Fig. 1. A plane graph G with a 4-gon H for A: = 2 (bold edges). 



Roughly speaking, S uses shortcuts of P on Qi, then shortcuts of P on Q 2 , and 
so on. Note that if P is a shortest path, then S is also a shortest path between 
u and V. (If U = Q for all i, then S = P.) Fig. [5] shows the simplified path S 
of P on Qi U Q 2 - Note that S is unique, but other shortcuts are possible, for 
instance, Q\[u, z\] U P[z\,z{] U Q 2 [z\tV\. 



For notational convenience, we use Pk{H) where i? is a graph instead of 



The next lemma shows an important property of t/j-gons. The proofs of the 
next two lemmas can be founded in [S]. 

Lemma 1. Let PI he a p-gon of G with p ^ 4 and let k ^ \_D/2\. Then for 
every w ^ Pk{H): 

1. there exists a shortest path A from w to Xp whieh is entirely eontained in 
reg{w, H) U H; 

2. there exists a shortest path B from w to Xi, i < p, whieh is entirely eontained 
in reg(m, H) U H. 



S 




•2 



Fig. 2. The simplified path S' of P on Qi U Q 2 . 



rk{v{H)). 
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Moreover, A and B can be chosen such that the two graphs Hi = B U (Qi U 
Q,+i U • • • U Qp-i) U A and H2 = B U {Qi_i U ft_2 U • • • U Qi) U Qp U A are, 
respectively, a {p — i + 2)-gon and an {i + 2)-gon of G. 

Lemma 2. Let Q be a path of q nodes. For every integer X 0, 7a(Q) = 
[9/(2A + 1)] . 

2.4 Main Result 

In this section we prove our main result, namely, that 7fc(G) < 3/e + 6 for every 
planar graph G and (5/7 + e)D (for any e > 0). 

Towards proving this result, we introduce and study the related notion of 
a k-dominating subgraph. Given a graph G, a connected subgraph H is called 
a fc-dominating subgraph of G if V{H) is a fc-dominating set for G. Our main 
result is derived from the following technical theorem concerning ^-dominating 
subgraphs, which may be of independent interest. 

Theorem 2. For every planar graph G of diameter D, and for every k ^ 
[5Z5/7J, there exists a k-dominating subgraph H of G such that H is composed 
of at most 6 shortest paths of G, and \ V{H)\ ^ 6D — 1. 

Proof. We present a constructive proof, implying a polynomial time algorithm 
for the construction of such a /c-dominating subgraph H . Without loss of gener- 
ality we assume hereafter that k < D, as the result clearly holds for k = D. The 
algorithm constructs H gradually. Letting 5{H) denote the deviation of a candi- 
date subgraph H from being a /c-dominating subgraph, i.e., 6{H) = |P\T/(i7)|, 
we attempt to progress by repeatedly decreasing S{H). 

Step 1: We start assuming that k ^ \D/2\. Let x\ and X2 be two nodes such 
that dG{x\,X2) > k, and let Qi be any shortest path connecting them. 

If V (Qi) is fc-dominating, then taking H = Qi we are done, as H is connected 
and \V{H) \ < D. 

Step 2: We construct a 3-gon H for G as follows. Let 0:3 ^ Fk{Qi), and let Q2 
(resp., Q3) be a shortest path between x^ and X\ (resp., between x^ and X2), 
chosen such that H = Qi U Q2 U Q3 induces at most two regions of R.^. This 
can be done by ensuring that for every t G <52 Cl Q3, the subpaths of Q2 and Q3 
from X3 to t coincide, i.e., Q2[x3,t] = Qs[x3,t], and analogously, that for every 
t€Q2CiQi, Q2[a;i,t] = Qi[a;i,t], and that for every t e Q3 n Qi, Q3[x2,t] = 
Qi[x2,i\. One can readily verify that iJ is a 3-gon of G. 

Step 3: is a 3-gon. 

If V (H) is a fc-dominating set then we are done, as is connected and \ V{H) \ ^ 
3D. 

Otherwise, let w ^ Fj;{H). By Lemma [H G has a (3 — i -I- 2)-gon and a 
{i 2)-gon. Since 1 ^ i < 3, G has a 4-gon. Set H to be this 4-gon. 

Step 4: is a 4-gon. 

liV{H) is a fc-dominating set, then we are done, as H is connected and \ V{H)\ ^ 
AD. 
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Otherwise, let m = 6{H) > 0, and let w ^ rk{H). Assume that H is defined 
by the nodes {x \, . . . , x^) and the shortest paths (Qi, . . . , <54 )- By Lemma[l] let A 
be the shortest path between w and X4 that is wholly contained in reg(?n, H)UH, 
and let B be the second shortest path between w and Xi, i <4, that is wholly 
contained in reg{w,H) U H. Let Hi = B U {Qi U Qi+i U • • • U Q3) U A and 
H2 = BU (Qi-i U • • • UQi) UQ 4 U A be the respective (6 — i)-gon and (i + 2)-gon 
whose existence is asserted by Lemma [TJ Note that Hi and H2 are 3-, 4- or 
5-gons. Figure depicts Hi when i = 2, in which case Hi = B U Q2U Q^U A, 
namely, it is obtained by following the cycle w, X2,Xs,X4, w. 

Let H' = Hi U H2 = AU H U B. If V{H') is a fc-dominating set, then we 
are done, as H' is connected and \V{H')\ ^ 6D — 1. Indeed, in the worst-case 
H' is composed of 6 paths of length D that meet in 5 nodes (xi, . . . , X4 and w), 
thus has 5 -I- 6(1? — 1) = 6Z? — 1 nodes. So hereafter assume that V{H') is not 
^-dominating. Let m' = 5{H') > 0, and let w' ^ Bk^H'). Note that m' ^ m — 1 
because V{H') covers Fk{H) and also w (as V{H) C V{H')). 

H' defines at most three regions, namely, feg(?c, H), and reg(w, H), which in 
turn is split by Hi into two sub-regions, Ri = feg(z, iJi) and R2 = reg{w,H) \ 
{Ri U Hi), where z is an arbitrary node of V{Hi) \ V{H2) (such a node must 
exist). See Fig. |3] Note that on Fig. O two incident paths may partially overlap: 
for instance A[i(;,t] = B[w,t] for some t is possible. The proof anyway is not 
based on this particular example. 




Fig. 3. The '0-gon Hi (in bold edges). A node w' ^ rk(H') has to belong to Ri or i? 2 . 



Corollary m implies that w' ^ feg(w,iL), since w and w' must belong to the 
same region. So assume that w' G Rj, for j = 1 or 2. Note that Rj = ieg{w', Hj). 
We now split the discussion into two cases. 

Case 1: Hj is a 3-gon or a 4-gon. 

In this case, we prove that Rk{Hj) = rk(H'). Clearly, rk{Hj) C Bk^H'). The 
converse is proved by contradiction. By Corollary (H Hj covers all the points of 
f^{w' , Hj), and thus H' covers those points as well since Hj C H' . Assume, 
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to the contrary, that there is some w” G Rj such that w” G rk{H') but w" ^ 
rk{Hj). This implies that there exists some u G V{H') \ V{Hj) such that w" G 
Ifc (u) . Any shortest path from u to w” must intersect Hj at some node v because 
Hj is a separator. Thus w" G Rk{v), proving that w" G rk(Hj), contradiction. 
Thus Fk^Hj) A Fk{H') as well. 

It follows that the number of nodes uncovered by F[j is m' ^ m— 1. Depending 
on whether F[j is a 3-gon or a 4-gon, we now set H ^ Hj and go to Step 3 or 
Step 4 and reiterate, and by induction the number of uncovered nodes S{H) will 
decrease to 0. 

Case 2: Hj is a 5-gon. 

Note that in this case the shortest path B is between w and X\ or X3, w.l.o.g. 
x\. (The other case is handled symmetrically by exchanging node names x\ and 
a^s-) 

Assume there is a path B' between w and X2 that is wholly contained in 
reg(w, H) U H . Then, we can remove the path B constructed while applying 
Lemma [1] and rename B' into B. The V'-gons Hi and H2 resulting from con- 
sidering the new simplified paths A and B are necessarily 4-gons, and we can 
conclude by Case 1. 

Similarly, if there is a path A' between w and X3 that is wholly contained in 
reg{w, H) U H , then we can remove path A and rename A' into A. The 4-gons 
resulting from considering the new simplified paths A and B allow us to go again 
to Case 1. 

We are thus left with the situation when none of the shortest paths from w 
to X2 and from w to X3 is wholly contained in reg(w, H) U H . 

Let A be a shortest path from w to X2, and let T be a shortest path from w 
to X3. (Without loss of generality we assume that X and Y are simplified on H). 
By assumption, X and Y are not wholly contained in reg(w, H) U H . Therefore, 
X must intersect Q3 or Q4, and Y must intersect Q\ or Q4. Let u be the farthest 
node from w such that u G A fl (Q3 U Q4), and let v be the farthest node from 
w such that v €Y D {Qi U Q4). 

Case 2.1: u G Q3. 

Similarly to the proof of Lemma [TJ A and Y intersect at some point a G 
feg(r(;, H). (See Fig. E]) Then A[u>, u] is wholly contained in reg(w, H)U H and 
intersects Q3 leading to X3. As both A and Y induce shortest paths from w to 
a, |A[?ii,u]| -I- |A[w,Q!]| = |F[?ii,a]|, or |A[?ii,u]| = |y[w,a]| — |A[u,a]|. Also, 
IQalujXsJI < |A[it,a]| -I- |A[a,X3]|. Hence |A[w,u]| -I- |Q3 [m, 3:3]| |F[w,a]| -I- 

|y[of,X3]| = |y[w,X3]|, so A[zz;,tt] U QslujX^] is a shortest path from w to X3 
wholly contained in reg(?n, H) U H, contradicting the definition of Y. Thus Case 
2.1 is not possible. 

Case 2 . 2 : v G Qi- 

This case is symmetric to the previous one; the path Y[w,v] U Qi[v,X2] is a 
shortest path wholly contained in reg(r<;, H) U H, contradicting the definition of 

A. 

Case 2.3: u,v G Q 4 . 

In this case we are left with a configuration similar to the one depicted on Fig.O 
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Fig. 5. The final configuration. 



We now assume that k ^ [5D/7J. Let a = dc(xi,u), b = dc{v,X 4 ) and 
c = da{u,v). Without loss of generality we assume that a ^ b. Note that since 
a + c + b ^ D, we have a ^ {D — c)l2. Note also that we have daiu^x^) ^ 
D — \ k\ — 1 and dcivjXs) < D — [fcj — 1 because w ^ A(u) and w ^ r'fe(u). 

If c ^ D/7, then 

dG{xi,X 2 ) ^ a + dG{u,X 2 ) < -{D — c) + D — [k\ — 1 
1 / 1 \ 5 5 

It follows that dG{xi,X 2 ) < 5D/7 — 1 < fc: a contradiction since i7 is a 4-gon, 
forcing to have dG{xi,X 2 ) > k. 

If c < D/7, then 

dG{x 2 ,X 3 ) ^ dG{x 2 ,u) + c + dG{v,X 3 ) ^ 2D -2[k\ -2+^D 
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^2D-2 -2+l^D = ^D-2 . (2) 

It follows that da{x 2 ,xz) ^ 5H/7 — 2<k, leading to contradiction since 77 is a 
4-gon, forcing to have d,G{x2,x^) > k. 

Hence the Case 2.3 is impossible as well. The theorem follows. □ 

We are now ready to state and prove our main result. 



Theorem 3. For every planar graph G of diameter D, for every e > 0, and for 
every integer k ^ (5/7 + e)D, we have ^k{G) < 3/e + 6. 



Proof. The connected subgraph H constructed in Theorem |2] consists of the 
union of at most 6 shortest paths, each of length at most D, and is a 7-dominating 
set for G for k ^ [5Z7/7J . Choosing a minimum cardinality A-dominating set for 
each path, we obtain a (fc + A)-dominating set B for G. By Lemma (2] \B\ ^ 
6\D/{2X+l)l 

Let us set A = [ eTA] . B is therefore a fc'-dominating set for G for every integer 
k' ^ [bD/7\ + [eTA] ^ (5/7 + e)7A. We have, for e > 0, 



\B\ ^ 6 



D 



2A+ 1 



< 



67A 



2[e7A] +1 



+ 6 < 



67A 



2e7A + 1 



3 ^ 

< h 6, 

e 



completing the proof of Theorem [3l 



□ 



2.5 Conjecture and Worst-Case 

We leave open the problem of bounding 7 fc(G) for k ^ 7A/2, and make the 
following conjecture. 

Conjecture 1. There exists a constant Cq such that for every planar graph G of 
diameter 7A, and for every k ^ D/2, ^k{G) < cq. 

Here we show that cq ^ 4. 

Theorem 4. For every even diameter 7A ^ 8, there exists a planar graph G of 
diameter D for which 7_d/2(G) = 4 . 

Proof. Let D = 2t for t ^ 4. The graph denoted by Gt is composed of 

(1) two nodes A and B called poles; 

(2) D disjoint paths Pi, , Pd called meridians, each of length D, joining A 
to B; 

(3) a cycle of length 2D, called equator, joining the middle nodes of each meridi- 
ans and with exactly one node of degree two between two consecutive merid- 
ians. 

(See Fig. E]for t = 4). 

Observe that for a cycle C 2 D of length 2D, Jd/ 2 {C 2 d) = 2. Hence taking 
S = {A,B}, it is clear that all the nodes of the meridians are covered, but 
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Fig. 6. The planar graph embedded on a sphere for D — 8 such that 74(G) = 4. 



the degree 2 nodes of the equator are note covered. Thus to complete S into 
a dominating set, we have to add two opposite nodes u, v on the equator. The 
resulting set S = {A, B, u, u} is then a Z)/2-dominating set of Gt- It follows that 
lD/2{Gt) ^ 4. 

Now if we take two opposite nodes different from A and B belonging to two 
distinct meridians we will not cover all the nodes of the 2t meridians. In this case 
we will need more than 4 nodes {t ^ 4) to cover all the nodes of the meridians. 
This completes the proof of the lower bound. □ 

3 Treewidth and Chordality Bounded Graphs 

In this section we establish some additional results concerning the value of 
7fc(G) for various graph classes. We use the notion of tree-decomposition in- 
troduced by Robertson and Seymour in their work on graphs minors |13| . A 
tree- decomposition of a graph G = {V,E) is a tree T = (V,E) whose nodes are 
subsets of V, such that 

Uxey X = V- 

2. for every edge {u, v) S E, there exists a node set df € F such that u,v G X; 

and 

3. for every X^Y, Z G V, if Y is on the path from X to Z in T, then XC\Z C Y. 

The width of the tree-decomposition T is co{T) = ma.x^^y{\X\ — 1}. The 
treewidth of a graph G is min7’{o;(T)}, where the minimum is taken over all 
the tree-decompositions T of G. Note that trees have treewidth 1. We use the 
following property whose proof can be founded in pp. 256]. 
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Property 1. Let {Xi,X 2 } be any edge of a tree-decomposition T of G, and let 
Ti,T 2 be the components of T \ {Xi, X 2 }, with Xi G Ti and X 2 G T 2 . Then 
I = XiC\ X 2 separates U\= (UagTi \ from U 2 = (UbgT 2 \ ^ in G. 



Theorem 5. For every graph G = (V, E) of diameter D and treewidth t, and 
for every k ^ [Z1/2J, we have 'yk(G) ^ t-l- 1. 

Proof. (Outline) Let T = {V , E) be a width t tree-decomposition of G, i.e., 
such that |X| ^ t -I- 1 for every X G V. We prove the theorem by showing 
that there exists some set X* G V that fc-dominates G. For X G V, let T{X) 
denote the subtree of T rooted at X, and let Succ(X) = UygT(x) 

Far(X) = P\ A(X). 

We construct a cycle-free path Xi, X 2 , . . . , X^ in T step by step, such that 
Xi is the root of T, Xm = X* , and Xi does not ^-dominate G for i < m. The 
initial step consists of selecting X\ and checking whether it fc-dominates G or 
not. Starting with a current node Xi of T, Step i consists of choosing the unique 
child Xi^i of Xi such that Succ(Xi_|_i) contains Far(Xi_|_i). It suffices to prove 
by induction on i that Step i is doable, namely, that there exists such a unique 
child Xi+i. Indeed, since the length of a cycle-free path in T is finite, we stop 
the process, in the worst-case, once reaching a leaf of T. The full proof can be 
founded in [S]. □ 

It is known that graphs of genus g and diameter D have a treewidth bounded 
by 0{gD), cf. |4]. For planar graph one can tighten this bound. For that we need 
the following definition. A plane graph is a 1-outerplanar embedding if all its 
nodes lie on the outer face. For r ^ 2, a plane graph is an r-outerplanar embed- 
ding if deleting all the nodes of the outer face yields an (r— l)-outerplanar embed- 
ding of the remaining graph. A graph is r-outerplanar if it has a r-outerplanar 
embedding. We refer to 1-outerplanar graphs (such as trees and AT 2 , 3 , A' 4 -free 
graphs) simply as outerplanar graphs. 

Theorem 6. For every planar G of diameter D, and for every k \D/2\, we 
have 'Yk{G) Gt 3D -f 3. 

Proof. Assume that G is r-outerplanar for r ^ 1. We first assume that r ^ 2. 
Consider an r-outerplanar embedding of G. Let Vi denote the outer layer of G, 
namely, the set of all the nodes lying on the outer face of G. Then, successively 
for i = 2 , . . . , r, let Vi denote the set of all the nodes lying on the outer face of 
G \ (Vi U • • • U Vi-i). For every i, consider some Ui G Vi. Observe that for every 
j > i, we have dc{ui,Uj) ^ j — i. Indeed, if dG{ui,Uj) < j — i, then there is 
a path of at most j — i — 1 edges linking a node of Vi to a node of V), which 
is impossible for a plane embedding of G (there is no edge between a node of 
Vi and a node of Ci+ 2 ). Thus dciui^Ur) ^ r — 1, implying that D ^ r — 1. 
Obviously, the latter result holds for r = 1. 

Now, in PP, it is shown that r-outerplanar graphs have treewidth at most 
3r — 1. The claim follows by Theorem O □ 
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A graph is c-chordal if every induced cycle is of length at most c. Triangulated 
graphs are precisely 3-chordal graphs. Using a result of that shows that c- 
chordal graphs have a tree-decomposition T = (U, E) such that for every X € V 
and all u,v £ X, dc{u, v) ^ c/2, one can show (full proof is given in M)- 

Theorem 7. For every c-chordal graph G of diameter D, and for every k ^ 
[H/2J -h [c/2j, we have "fk{G) = 1. 

4 Application to Compact Routing 

Let us now switch to discussing an application of our results in the area of 
communication networks. A point-to-point communication network is modeled 
as a graph G = (V,E), where the set of nodes represent the processors of the 
network and every pair of two opposite arcs represents a bidirectional commu- 
nication link. A routing scheme i? is a distributed algorithm whose role is to 
deliver messages between nodes of the network. The routing scheme consists 
of certain distributed data structures in the network, and a delivery protocol, 
which can be invoked in any node u with two parameters: a routing label C(y) 
of the destination node v, and the message’s information field. The message 
is delivered to v via a sequence of transmissions determined uniquely by the 
distributed data structure, i.e., at each intermediate node along the route, the 
routing scheme decides the next neighbor to which the message should be for- 
warded. The length of the route traversed by a message from m to u in the graph 
G according to the routing scheme R is denoted by pn{u,v). The dilation of a 
routing scheme R is the maximal route length of a path traversed by a message, 
formally max^^t, {pn{u,v)}. 

An interval routing scheme i? on G is a routing scheme consisting of a pair 
(£,I), generated in the preprocessing step, where £ is a node-labeling, £ : U — >■ 
{!,... ,n}, and I is an arc-labeling, I \ E ^ that satisfy the following 

condition. For any node u, the collection of sets that label all the outgoing arcs 
of u forms a partition of the name range (possibly excluding u itself^) . Formally, 
for every u € V, denoting by £„ the set of arcs incident to u, 

1- UeGis„ ^(e) = {1, • • • , ^} \ {^(w)}. 

2. I(ei) m(e2) = 0 for every two distinct arcs ei, 62 G E^- 

The delivery protocol is defined as follows. Given a destination node v, set 
the first header to be = £(u). Also, for every node u, receiving a message 
with header h, first check if £(u) = h, and deliver the message locally if equality 
holds (meaning that u is the message’s final destination). If not, then forward the 
message with the same header h on the output port corresponding to the unique 
arc (u,w) such that h G I(u,v), namely, the arc whose labeling set contains the 
destination label. 

^ A labeling excluding u from its arc-labels is termed strict. Although non-strict la- 
beling may produce more compact schemes, in this paper we restrict our attention 
to strict labeling only. 
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Note that any interval routing scheme can be implemented by classical rout- 
ing tables. The main difference is in the coding of the table. 

Given an integer n and a subset I C n}, define the compactness of I 

w.r.t. n, denoted Cn{I), as the smallest integer k such that I can be represented 
by the union of k intervals [a,b\ of consecutive integers from n}, with 

n and 1 being considered as consecutive (cyclically). The compactness of an 
interval routing scheme R = (C,I) on G is the maximum, over all arcs e € E, 
of the compactness c„(I(e)) of the set 1(e) labeling e. The total compactness is 
the sum c™(T(e)). 

Because interval routing schemes can be implemented by routing tables, we 
deal with the compactness of a routing table. Intuitively, smaller compactness 
and degrees imply smaller routing tables. The interval routing strategy presented 
in m, based on routing on a minimum spanning tree, has compactness 1 (for 
every graph of diameter D) and dilation at most 2D. It is known that there are 
worst-case graphs for which every routing table of compactness 1 has dilation 
at least 2D — 3 [IS]. It is also known that every graph has routing table of 
compactness Vn Inn-I-O(l) with dilation at most [1.5Z?] [in], whereas there are 
worst-case graphs for which every routing table of compactness k has dilation 
at least [1.5Z3J — 1 for every k = f^( j>iog”„/j>) ) 0- Other results can be found 
on the recent survey |6]. 

Here we show a trade-off between dilation and compactness. 

Theorem 8. Let G he a graph of diameter D with n nodes and m edges. Let 
k ^ 0, and let t = 'yk(G). Then, G has a routing table with dilation at most 
D + k, compactness at most t/2 -|- 1, and total compactness at most tn + 2m. 
Moreover, ift/logn — >■ -l-oo, then the compactness can he reduced to t/4 + o{t). 

Proof. Let {x\, . . . ,Xt\ be a fc-dominating set in G of minimal cardinality t = 
7 fc (G) . Construct a partition of the graph nodes into t pairwise disjoint connected 
regions Ri C Rkixi) around each xi, such that each region Ri consists of nodes 
closest to Xi (breaking tie arbitrarily) and the union of the regions covers all the 
nodes. Obviously the radius of each region is at most k. 

For every Xi, construct a shortest path spanning tree R for G. Let Tj be the 
restriction of Ti to the region Ri, i.e., Ti = Ti (1 Ri. 

Partition the range of integers [1, n] into t contiguous intervals Li of size 
for i £ {1, . . . ,t}. For each region Ri, assign each node v G Ri a, distinct label 
C{v) from the range li, in DFS order, starting at Xi. Use these labels to define 
an interval routing scheme on Ti as in HD. The compactness is 2 (instead of 1) 
because a cyclic interval in U has to be represented by two sub-intervals of U. 
For every two nodes u,w G Ri, this scheme yields a shortest route on Ti (albeit 
perhaps not shortest in G). 

In addition, for every 1 ^ i ^ t and for every node w ^ Ri, add the interval 
li to the arc connecting w to its parent in Ti. 

As the regions are disjoint, it is easy to verify that for every node v, all the 
intervals assigned to the arcs of v are pairwise disjoint. Overall, at most t/2 + 1 
intervals were assigned to each arc because every subset of at most t intervals has 
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no more than t/2 non contiguous intervals (one more sub-interval is required for 
the own region of v). In fact, exactly t + deg(z;) intervals altogether (deg(?;) -I- 1 
intervals for Ri and t — 1 for all the others) are assigned to the arcs of a node v 
of degree deg(i;). Thus the total compactness is tn + 2m. 

Observe that the resulting route between nodes belonging to the same region 
has length at most 2k ^ D + k. As for routing from a node u G Ri to a node 
w G Rj, j yf i, note that the first segment of the route, until reaching Rj, 
proceeds along a shortest path from u to Xj , and once entering Rj , the remaining 
segment of the route follows a shortest path. Hence the total length of the route 
cannot exceed dciu, Xj) + dc{xj,w) ^ D + k. 

Actually, the bound on the compactness can be slightly reduced to t/4 + o{t), 
for t large enough, using the result of [H]. This result shows that, for every t, 
there exists a permutation tt of {1, . . . , t} such that the compactness of every set 
T’ of a family T of subsets of {1, ... , t}, with \T\ < exp(t/2)/t, satisfies: 

Ct(7r(F)) < ^ • n), 

where tt{F) denotes the set {7r(a:) | x G f}. Let 1(e) be the set of labels assigned 
to arc e by the previous interval routing scheme. Set T = {F{e) \ e G E{G)}, 
where F{e) = {i \ li G 1(e)} corresponds to the set of region’s indices contained 
in 1(e). Note that the range of labels of each region (except for u’s) is wholly 
contained in 1(e). Since Uj=i we have that c„(I(e)) ^ Ct(F(e)) -I- 1. 

Observe that the relative order of the ranges can be chosen arbitrary: all the 
labels of some Ij can be chosen to be larger or smaller than all the labels of any 
other range. So, indices of Ij's ranges can be permuted by some permutation tt 
of [l,f] in order to decrease Ct{n{F{e))). By jS], if \F\ < exp(o(t))/t, then there 
exists ttq such that Ct(7To(F(e))) < t/4 -|- o(<), for every F{e) G T . Note that 
\F\ — 2m. So, the condition \F\ < exp(o(t))/t can be replaced by t/logn — >■ -l-oo, 
as TO = O(n^). □ 

This result can be viewed as a generalization of |10| . originally proved for 
k = [D/2]. In that paper it was shown that every graph G satisfies 7 fc(G) ^ 
[ \/nlnn ] , and the compactness derived therein was [ \/nlnn ] -I- I . Thus Theo- 
rem [8] improves the compactness bound obtained in | I0| by an asymptotic factor 
of 4. The same kind of construction has been also implicitly used in jlB]. Specif- 
ically, it is shown in m that every graph G satisfies 'jk{G) ^ y/n for k ^ 2D/3. 
Thus, every graph has routing table of compactness at most + o[^Jn) for 

dilation [5D/3]. (The original compactness result of |IE] was [\/nl + !■) 
Combining Theorem [31 and Theorem El we obtain the following. 

Corollary 3. Let e > 0 be an arbitrary constant. Every planar graph has a 
routing table with dilation at most [(12/7 -|- e)D~\, and constant compactness. 

Note that Conjecture [T]for cq = 4 implies that every planar graph of diameter 
D enjoys a routing table with dilation at most [1.5D] and compactness at most 3. 
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Abstract. The Steiner tree problem asks for a shortest subgraph connecting a 
given set of terminals in a graph. It is known to be APX-complete, which means 
that no polynomial time approximation scheme can exist for this problem, unless 
P=NP. Currently, the best approximation algorithm for the Steiner tree problem has 
a performance ratio of 1 .55, whereas the corresponding lower bound is smaller than 
1.01. In this paper, we provide for several Steiner tree approximation algorithms 
lower hounds on their performance ratio that are much larger. For two algorithms 
that solve the Steiner tree problem on quasi-bipartite instances, we even prove 
lower bounds that match the upper bounds. Quasi-bipartite instances are of special 
interest, as currently all known lower bound reductions for the Steiner tree problem 
in graphs produce such instances. 



1 Introduction 

Given a graph G = {V,E),& length function on its edges, and a set i? C C of terminals, 
a Steiner tree is a connected subgraph of G spanning all vertices in R. The Steiner tree 
problem in graphs is to find a shortest Steiner tree. This problem is a classical NP-hard 
problem flOll and even worse it is also known to be APX-complete, i.e., there exists some 
constant c > 1 such that no polynomial time approximation algorithm for this problem 
can have a performance ratio smaller than c, unless P=NP. (The performance ratio of an 
approximation algorithm is the largest ratio between the length of a solution found by 
the algorithm and the optimal length.) 

The Steiner tree problem appears in many different applications, as for example in 
VLSTdesign, in the design of telecommunication networks, and in the reconstruction of 
phylogenetic trees in biology. Therefore it is an important task to find good approxima- 
tion algorithms for this problem and, eventually, to prove that no better approximation 
algorithms can be possible, unless P=NP. 

Currently, the best approximation algorithm for the Steiner tree problem is due to 
Robins and Zelikovsky ||7] and has a performance ratio of 1 -F | In 3 < 1.550. On the 
other hand, the largest known lower bound for the approximability of the Steiner tree 
problem has a value not exceeding 1.01 1 41121 . 

This large gap between lower and upper bounds suggests that there might be still some 
room for improvements on both sides. There exist several approximation algorithms for 
the Steiner tree problem in graphs that achieve a performance ratio less than 2 il. But 
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Table 1. Summary of results 





approximation algorithm 


known up- 
per bound 


our lower 
bound 


general graphs 


Relative Greedy Algorithm 


1.694 


1.333 




Loss Contracting Algorithm 


1.550 


1.200 


quasi-bipartite graphs 


Iterated 1 -Steiner Heuristic 


1.500 


1.500 




Loss Contracting Algorithm 


1.279 


1.195 




Greedy-MSS 


1.216 


1.216 



for none of these algorithms it is known whether their analysis is tight. Therefore it is 
not clear which of these algorithms actually achieves the best performance ratio. 

An instance of the Steiner tree problem is quasi-bipartite if the set y \ i? is inde- 
pendent. Quasi-bipartite graphs are an important special case as the instances resulting 
from lower bound reductions Em have this form. For these graphs there exist several 
approximation algorithms for which better performance ratios have been proved than in 
the general case. 

In this paper, we provide for several Steiner tree approximation algorithms lower 
bounds on their performance ratio. For two algorithms that solve the Steiner tree problem 
on quasi-bipartite instances, we even prove lower bounds that match the upper bounds. 
Such lower bound results, even if they are not tight, allow to estimate the quality of a 
given performance analysis and can provide ideas on how to improve it. Moreover, these 
lower bounds give good examples of instances on which every Steiner tree approximation 
algorithm must perform well, and thus might yield to better approximation algorithms 
for the Steiner tree problem. 

All algorithms considered in this paper are based on a general greedy framework 
which we describe in Section El In Sections Eland 0] we present lower bounds for two 
approximation algorithms for the Steiner tree problem in general graphs. Lower bounds 
for approximation algorithms for the quasi-bipartite case are given in Sections 01 El 
andO For two of these algorithms we even provide tight lower bounds. Our results are 
summarized in TablelH 



2 Notations and a General Framework for Greedy Algorithms 



Given a graph G = {V,E),& length function on its edges, and a set i? C 1/ of terminals, 
a Steiner tree is a connected subgraph of G spanning all vertices in R. We denote a 
Steiner minimum tree by SMT and its length by smt. A Steiner tree usually contains 
not only vertices from R but also from V\R. These vertices are called Steiner vertices. 
Every Steiner tree can be split into so called full components. A full component is a 
Steiner tree for a subset of R in which every terminal is a leaf. The length of a full 
component is the sum of its edge lengths. Therefore, a Steiner tree is a collection of full 
components which is connected and covers R. 
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{ FC is a set of full components } 

* ^ 0 

While an improving full component exists : 
Find Ti+i G FC that minimizes fi 
i ^ i + 1 
imax ^ i 

Output Steiner tree using Ti , . . . , 

Fig. 1. A general framework for greedy algorithms. 



Most of the known approximation algorithms for the Steiner tree problem in graphs 
ht into the general framework shown in Figure[T] Let FC be a set of full components. The 
algorithm chooses the (i + l)-st full component S FC such that a certain function 
fi : FC —>■ K+ is minimized. The algorithm stops when no further improvement is 
possible. The precise meaning of this condition depends on the function fi. Finally 
a Steiner tree is output that uses the full components that have been selected in the 
while-loop. 



3 Relative Greedy Algorithm 



In this section we consider Zelikovsky’s relative greedy algorithm im, which achieves 
a performance ratio of less than 1.694. This algorithms fits into the general greedy 
framework for Steiner tree approximation algorithms as follows. Let G = (V, F) be 
a graph with a weight function on the edges and R C V he the set of terminals. Fix 
some constant k € N and define FC to be the set of all full components with at most 
k terminals. The weight of a full component X G FC is simply its length and will be 
denoted by | X |. Note that FC can be computed in polynomial time. 

From G one can easily derive the so called terminal distance graph G' which is a 
complete graph on the set R where the weight of an edge is the length of a shortest path 
in G connecting the two endpoints of this edge. If Ti , . . . are some full components 
in FC then MST{R/Ti . . . Ti) denotes a minimum spanning tree in the graph that is 
obtained from G' by contracting each of the full components Ti, ... ,Ti. Its length is 
denoted by mst{R/Ti . . . Ti). 

Now the function fi on a full component X G FC is defined as follows; 



MX) 



m 

mst{R/Ti . . . Ti) — mst{R/Ti . . . TiX) 



The while-loop of the algorithm will be executed, as long as there exists some full 
component X G FC with fi{X) < 1. Zelikovsky proved that on termination of the 
relative greedy algorithm the following inequality holds: 



|Ti| -I- . . . -I- < (1 -I- In 2 -f £)• smt . 



If e is chosen small enough this value is smaller than 1.694 . 
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Theorem 1. The performance ratio of the relative greedy algorithm is at least i/ 3. 

Proof. We will construct a family of instances Fi for the Steiner tree problem and prove 
that the ratio between the length of a Steiner minimum tree and the length of a Steiner 
tree that is found by the relative greedy algorithm tends to 4/3 as i oo. 

The instance Fi has (2* — 1) (2* — 1) + 1 terminals which are arranged in a (2* — 1) x 
(2* — l)-grid with one extra terminal s. We assume that s belongs to every row and every 
column of the (2* — 1) x (2* — l)-grid, i.e., we can think of s as the last row and last 
column of a 2® X 2®-grid. For each row, we have a binary tree, called the row tree, which 
has depth i and the 2® terminals of a row as its leaves. For each column we have a star, 
called the column star, which has the 2® terminals of a column as its endpoints. No two 
row trees or column stars have a vertex in common, except for terminals. Thus, in total 
the instance Fi has (2® — 1)(2® — 1) + (2® — 1) = 2®(2® — 1) non-terminal vertices. 
Finally, we connect any two consecutive terminals in a row by an edge. In Figure|2]one 
row tree and one column star and the edges between any two consecutive terminals in 
one row of the instance F3 are shown. 

We now have to specify the weights for the edges of the instance Fi. For all row 
trees all edges within one level have the same weight. The edges of the first and second 
level of the binary tree, i.e., the six edges incident to the root of the binary tree and its 
two children, have weight 1 /4. The 2® edges of the i-th level of the binary tree, i.e., the 
edges incident to the terminals have weight 1. All other edges in the binary tree have 
weight 1/2. The total weight of all row trees is therefore 

(2® - 1)(2® . 1 + (2® - 8) • ^ + 6 • i) = (2® - 1)(3 • 2®"i - . 

Next we specify the weights for the edges of the column stars. All edges within 
one column star have the same weight, which depends on the column. The columns are 
numbered from left to right by 1, 2, . . . , 2® — 1. We number a column star with a value 
that is obtained from the binary representation of the column number read from right to 
left. E.g., for i = 3 the binary representation of the coulmn numbers is 001, 010, 011, 

100, 101, 110, 111 and these numbers read from right to left give 100, 010, 110, 001, 

101, oil. 111. Thus the column stars are numbered 4, 2, 6 , 1, 5, 3, 7 from left to right. 
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Table 2. Small lower bound instances for the relative greedy algorithm. 



i 


lower bound 


edge weights (levelwise) 


lower bound 


edge weights (levelwise) 


3 


II = 1.2368 


1 i ^ 

-L) 4? 4 


1.2895 


1 i i 

-L? 4^ 4 


4 


ffl = 1.3081 


1 1 1 1 
-L? 2’ 4’ 4 


1.3151 


1 1 1 1 
-L? 2 ’ 4 ’ 4 


5 


fi = 1-3255 


1 1 1 1 1 


1.3290 


1 1 3 1 1 

2 ’ 8 ’ 4 ’ 4 


6 


ii A 1.3305 


1 1 1 1 1 1 

2 ’ 2 ’ 2 ’ 4 ’ 4 







All edges in the column stars with numbers 1 to 2*“^ + 2 have the same weight. We 
dehne it as (2 — 2^“*)(1 — 3 • 2“®“^). The edges in all other column stars get the weight 
2 — 2^“*. As each column star has 2* edges, the total weight of all column stars is 

2L(2-2i-*)((2*-i + 2)(l-3-2-*-i) + (2*-i-3)) = 2- (2* - 1) • (2* - ^ . 

Finally we have to specify the edges connecting two consecutive terminals in a row. 
They are all defined as (2 — 3 • 2“*) (2 — 2^“*). 

Note that the set of all row trees is a solution to the Steiner tree problem (in fact, 
it is easily seen that it is the optimum solution for the instances Fi). We claim that the 
relative greedy algorithm started with A: = 2® will return the set of all column stars as 
solution. Therefore the performance ratio of the relative greedy algorithm is at least 

2.(2®-1).(2®-|- A) 

(2® - 1)(3 -2®-i - |) . 

This expression tends to 4/3 as i goes to infinity, which proves the statement of the 
theorem. 

To prove that the relative greedy algorithm indeed returns the set of column stars as 
solution we assume that it selects as the /-th full components the column star numbered 
with j, i.e., we assume that the ties between the weights of the column stars are broken 
exactly in this way. By adding a weight of / • e to the j-th column star one could 
break all the ties, but this would make the calculations more complicated. It is now a 
straightforward but quite tedious calculation to show that the cost function of the relative 
greedy algorithm is minimized in the j -th step by the column star numberd with j . Details 
of this calculation are omitted in this version of the paper due to space restrictions. □ 

In column two of Table El the lower bounds that are obtained by the construction 
described in the proof of Theorem [Hare given. The third column contains the weights 
of the edges of the row trees. The construction given in the proof is not optimal as is 
indicated by columns four and five of this table. The lower bounds can be improved 
slightly by choosing better weights for the edges of the column stars and row trees. For 
i = 3, 4, 5 we computed such optimal weights and obtained the lower bounds given in 
column four of Table E] We were not able to calculate a general formula for the lower 
bound of these instances, but it seems to be the case that the lower bounds also converge 
to 4/3. 
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Fig.3. A 1.2 lower bound instance for the loss contracting algorithm. All edges of the row trees 
have weight 5. The edges incident to s and t have weights 9 and 6, respectively. 



4 Loss Contracting Algorithm 



Currently the best approximation algorithm for the Steiner tree problem in graphs the 
loss contracting algorithm of Robins and Zelikovsky 0 which has a performance ratio 
of at most 1 + ^ < 1.550. The loss contracting algorithm is very similar to the relative 
greedy algorithm, but instead of contracting a selected full component T entirely, it 
contracts only a subset Loss{T) of its edges in order to achieve the same effect at a 
lower price. The loss of a full component T is a minimum length forest contained in T 
such that every Steiner vertex of T is connected to a terminal. Its length is denoted by 
loss{T). The contracted parts are connected by a minimum spanning tree using edges 
from the terminal distance graph. We denote its length by m(-) := mst{R/Loss{-)). 
Clearly, the length of the Steiner tree corresponding to the full components Ti, . . . ,Ti 
is lossiTi, ...,Ti)+ m(Ti, . . . ,Ti). 

The loss contracting algorithm fits into the framework of Figured! In each step, the 
loss contracting algorithm chooses a full component that minimizes the function 



MT) 



loss{T) 

m{Ti . . .Ti) - m{Ti . . . RT) 



It stops when there are no more improving full components, i.e., minreFC fi{T) > 1. 



Theorem 2. The performance ratio of the loss contracting algorithm is at least 1.2. 

Proof We use a construction similar to the one used in the proof of Theoremd! Unfor- 
tunately, this construction breaks down for grids of size larger than 4x4. We explain 
the best weight assignment for a 4 x 4-grid as shown in Figured! The edges of the row 
trees have weight 5. The edges incident to s and t have weights 9 and 6, respectively. We 
denote the full components containing s and t by Tg and T(. The optimal Steiner tree 
uses the row trees and has length 75. 

Notice that the edges in MST{R) connecting the vertices of the upper row of ter- 
minals have lengths 10, 15, and 10 from left to right. For each row tree, the loss is 
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Fig. 4. A quasi-bipartite 1.195 lower bound instance for the loss contracting algorithm. 



10 and fo is 10/ (3 • 35 — (2 • 35 + 15)) = 0.5. One can easily check that sub- 
trees of the row trees are worse. Since fo(Ts) = 9/(3 • (35 — (20 + 9))) = 0.5 
and fo{Tt) = 6/(3 • (35 — (25 -I- 6)) = 0.5, the algorithm is allowed to choose Tg. 
(Choosing Tg could be forced by perturbing the edge weights.) In the second step, the 
value of fi for each (complete) row tree is 10/ (3 • 29 — (2 • 29 -f 15)) = 10/14 > 0.7, 
and fi{Tt) = 6/(3 • (29 — (10 -f 9 -f 6))) =0.5. Therefore Tt is chosen. Now the value 
of /2 for each row tree is 10/ (3 • 25 — (2 • 25 + 15)) = 1, and the algorithm stops with 
a solution of length loss(Ts,Tt) + m{Tg, T() = (9 + 6) + 3 • (10 + 9 + 6) =90. We 
get a lower bound of 90/75 = 1.2. □ 

Robins and Zelikovsky n also showed that the performance ratio of the loss con- 
tracting algorithm is at most 1.279 if the graph is quasi-bipartite. A quasi-bipartite lower 
bound instance for the loss contracting algorithm is shown in Figure[4l It is only slightly 
weaker than the one shown in Figure |3]and comes fairly close to the performance ratio 
proved in Q. 

Theorem 3. The performance ratio of the loss contracting algorithm in quasi-bipartite 
graphs is at least (5 — \/2)/3 > 1.195. 

Proof. The quasi-bipartite lower bound instance is shown in FigureH) It is based on the 
observation that one full component chosen by the algorithm can ‘destroy the benefit’ of 
two full components of the optimal solution. The Steiner minimum tree consists of the 
full components Tyj and and has length 6. MST{R) uses four diagonal edges and 
has length 8. 

At the beginning, we have fo{Tu) = (2 — -\/2) /(8 — (2 • 2 + 2 • V^)) = 0.5 and 
/o(T„) = (2 - V2)/(8 - (2 • 2 + 2 • (3 - ^2))) = l/\/2 > 0.5, whereas /o(T^) = 
fo{Tx) = l/(8— (2-2+2- 1)) = 0.5. Therefore we may assume that the loss contracting 
algorithm decides for T„. MST{R/Loss{Tu)) has the same form as MST{R), but the 
two edges to the right have shrunk from 2 to v/2- Therefore m(T„) = 4 + 2\/2. 

In the next step, we have /i(T„) = (2 — \/2) /((4 + 2\/2) — (2 • (3 — \/2) + 2 • 
V2)) = 1/^ and /i(T^) = /i(T,) = l/((4 + 2^2) - (1 + 1 + 2 + ^2)) = 1/V2. 
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We may assume that the loss contracting algorithm chooses T„. The edges on the left 
hand side of MST(R/Loss(Tu,Ty)) have shrunk from 2 to 3 — \/2. Now we have 
to(T„, T„) = 2 • (3 - \/2) + 2 • \/2 = 6. 

As / 2 (T„) = / 2 (T,) = 1/(6 - (1 + 1 + (3 - v/2) + V2)) = 1, no further 
improvement is possible and the algorithm will stop at this point without using and . 

The length of the Steiner tree found by the loss contracting algorithm is \Tu \ + \Ty\ = 
10 — 2v^- The lower bound follows. □ 



5 Iterated 1- Steiner Heuristic 

The iterated 1-Steiner heuristic is a simple greedy local search heuristic. Recall that the 
Steiner minimum tree for a set of required vertices R can be reconstructed from its set 
of Steiner vertices / as SMT{R) = MST{R U I). Starting from a spanning tree for 
the terminal set, i.e. / = 0, the iterated 1 -Steiner heuristic adds in each step a single 
Steiner vertex to I if this will reduce the size of the MST. The algorithm stops if no such 
improvement is possible. A vertex s is accepted if 

fi{s) := mst{RU{si, . . . ,Si,s}) - mst(i?U {si, . . . , s J) < 0. 

Since iterated 1 -Steiner starts from a spanning tree in the terminal distance graph and 
mst{R) < 2 • smt{R), it is clear that its solution is never more than a factor of 2 away 
from the optimum. No better performance ratio has been proven. Minoux m showed 
how to accelerate iterated 1 -Steiner in quasi-bipartite graphs, but did not improve the 
performance ratio. 

Rajagopalan and Vazirani gave a 3/2 -|- £ approximation algorithm for the Steiner 
tree problem in quasi-bipartite graphs which is based on the primal-dual method. They 
pointed out that approximation algorithms for the Steiner tree problem in quasi-bipartite 
graphs are a significant step towards better approximation algorithms for general in- 
stances. By now, the best approximation ratio known for the quasi-bipartite case is 1.279 
and belongs to the loss contracting algorithm of Robins and Zelikovsky Q. In the same 
paper, they also showed that already the simple iterated 1 -Steiner heuristic achieves an 
approximation ratio of 3 /2 in quasi-bipartite graphs. Here we give a family of instances 
for which this ratio is asymptotically attained. This shows that the analysis of iterated 
1 -Steiner in quasi-bipartite graphs in mi is tight. Interestingly, their result does not rely 
on a greedy (locally optimal) choice of the next Steiner vertex which is included into 
the set of Steiner vertices I. It is not necessary to find a vertex s that minimizes /i(s); 
non-positivity suffices. 

A slight improvement of the iterated 1 -Steiner heuristic is to remove Steiner vertices 
which have degree one or two in MST (i? U /). (These may result from changes in the 
topology during the course of the algorithm.) Steiner vertices of degree one can only add 
to the length of the minimum spanning tree without connecting a terminal, and Steiner 
vertices of degree two can be removed safely because a shortcut is present in the terminal 
distance graph due to the triangle inequality. 

Theorem 4. The performance ratio of the iterated 1 -Steiner heuristic in quasi-bipartite 
graphs is 3/2 (even if Steiner vertices of degree 1 or 2 are removed). 



Lower Bounds for Approximation Algorithms for the Steiner Tree Problem 



225 




k Steiner vertices 



Fig. 5. A simple lower bound instance for iterated 1 -Steiner. 




k Steiner vertices 



Fig. 6. A lower bound instance for iterated 1-Steiner with vertex removal. 



Proof. A bad instance for the iterated 1 -Steiner heuristic without vertex removal is shown 
in Figure^ All edges have unit weight. Assume that the algorithm has already chosen the 
k Steiner vertices of degree 3 in the row (each of them reduced the value of mst{R U I) 
by one when it was added to I). Then the iterated 1-Steiner heuristic will stop at this point 
without including s, because this would increase the length of the minimum spanning 
tree from 3k to 2k + 1 + k. The Steiner minimum tree, however, consists of the star 
around s. We get a lower bound of 2 kXi ’ which converges to | as A: — c». 

The instance from Figure 0 does not work for the iterated 1-Steiner heuristic with 
vertex removal, since it will remove all the Steiner vertices from I and keep only s for the 
final solution. Consider Figure Elinstead. Again we assume that the set I consists of the 
Steiner vertices of degree 3. We have split the vertex s from Figure Elinto four vertices 
a, b, c, and d, each of degree ^ | . Another vertex e of degree four is connected to one 
terminal from each of these stars around a, b, c, and d. The connection points are spread 
out in such a way that for each u G {a, b, c, d, e}, no vertex in I is adjacent to more than 
one terminal from the star around u. This implies that no vertex of / can have degree 
one in MST{R U / U {it}) and consequently, mst{R U / U {it}) = mst{R U /) -f 1. 
Therefore iterated 1-Steiner with vertex removal will stop at this point. The resulting 
lower bound is 



226 



Clemens Gropl et al. 



6 Greedy-MSS 



Recently, a new approximation algorithm for the Steiner tree problem in quasi-bipartite 
graphs was presented in |21 . The analysis of this algorithm uses a matroid-style exchange 
argument, and uses ideas from the analysis of the greedy algorithm for set cover. (In fact, 
it can even be extended to the minimum spanning set problem in polymatroids ||2].) 

The resulting algorithm Greedy-MSS d achieves a performance ratio of 1.217 
for the Steiner tree problem in quasi-bipartite graphs where all edges incident to a 
Steiner vertex have the same weight. Such instances are called uniformly quasi-bipartite. 
Previously the best perfomance ratio for this class of instances was 1.279 and due to the 
loss contracting algorithm of [7]|. (See also Section @1) 

In quasi-bipartite instances of the Steiner tree problem in graphs, every full com- 
ponent contains a unique Steiner vertex. Algorithm Greedy-MSS tries to minimize the 
length-per-connection ratio greedily, i.e., we have 



h{T) := 



1^1 

c^{T) 



in the general greedy framework (Fig.|T]). Here \ T\ denotes the length of the full compo- 
nent T and Ci{T) is defined as the difference between the number of components before 
and after T is added to the subhypergraph {R, {Ti, . . . , Ti}). The algorithm stops when 
{R, {Ti, . . . , Tp}) is a spanning subhypergraph. While the number of full components 
around a particular Steiner vertex which might be added in step i can in general be 
exponential, it is easy to see that Greedy-MSS will always choose one that yields as 
many new connections as possible among them, and due to the uniformity condition all 
of these full components have the same weight. Therefore, the algorithm only has to 
evaluate one full component for each Steiner vertex in each phase. Greedy-MSS can be 
implemented to run in 0{^V^R) time. 

Theorem 5. The performance ratio of algorithm Greedy-MSS is 73/60 = 1.216. 

Proof. The upper bound on the performance ratio was proved in 0. Here we give an 
instance for which the performance ratio 73/60 of Greedy-MSS is attained. 

The worst case instance is shown in Figurel?] All edges have length 1. The terminals 
{ Tij I 1 < i < 4, 1 < j < 12 } are arranged in form of a grid. (For notational 
convenience we will number the rows from bottom to top.) We also have a terminal tq 
which is shown in the top left corner. The set of potential Steiner vertices is AT U F. 
The set X consists of the vertices {xi, . . . , X 12 ], where Xj is connected to ri ^ , . . . , r 4 ^ 
and tq. a Steiner tree using the Steiner vertex set X has total length 5 • 12 = 60, which 
is in fact optimal. The set Y consists of vertices yij which are connected to tq and a 
subset of terminals in a row, as indicated by the dashed rectangles. Formally, we have 
^ { Vij I 1 < * < 4, 1 < j < 12/* }, and yij is connected to . . . , 

and rg. Next we show that Greedy-MSS may end up with a Steiner tree using the Steiner 
vertex set Y. Since the total length of this Steiner tree is3-5-|-4-4-|-6-3-|-12-2 = 73, 
the theorem follows. 

Note that in the unweighted quasi-bipartite case, fi{T) = |T|/(|T| — 1) for every full 
component T that does not create a cycle. Therefore Greedy-MSS will always choose a 
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Fig. 7. A worst-case instance for Greedy-MSS. 



full component that connects as many connected components as possible. Assume that 
the Steiner vertices t/4 i, . . . , t/4 3 have already been chosen. Then the full components 
around the Steiner vertices in X have been ‘nibbled off’ such that the size of any largest 
full component that does not create a cycle has dropped down to 4 . Therefore Greedy- 
MSS might continue choosing the Steiner vertices j/3 1, . . . , 7/3 4. Going forth in this 
way, we arrive at Y and stop since the graph is connected. □ 

A smaller worst case instance for Greedy-MSS was given inQ. It was inspired by 
our proof of the performance ratio of Greedy-MSS, but requires edge weights. 
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Abstract. NLCj, for fc = 1, . . . is a family of algebras on vertex-labeled 
graphs introduced by Wanke. An NLC- decomposition of a graph is a 
derivation of this graph from single vertices using the operations in 
question. The width of such a decomposition is the number of labels 
used, and the NLC-width of a graph is the minimum width among its 
NLC-decompositions. Many difficult graph problems can be solved ef- 
ficiently with dynamic programming if an NLC-decomposition of low 
width is given for the input graph. This paper shows that an NLC- 
decomposition of width at most log n times the optimal width k can be 
found in time. Related concept: clique-width. 



1 Introduction 

A frequently proposed approach for dealing with graph problems is to decompose 
the input graph in such a way that dynamic programming can then be applied. 
In [15] Wanke introduced the algebra of /c-NLC graphs for this purpose. Graph 
vertices are labeled with numbers in [fc] = {!,..., fc}, and the operations are 
relabeling and union. The latter permits edges to be drawn between the two 
involved graphs. Importantly, this is done on the basis of vertex labels; once two 
vertices have acquired the same label, they can no longer be differentiated for 
edge-drawing. The class of A:-NLC graphs, also called NLCfe, now consists of all 
graphs that can be derived from single vertices with these operations. Wanke 
went on to show that the maximum cut size of a graph can be computed in 
polynomial time if the graph is given as an NLCfc-decomposition (a derivation 
in NLCfc), and that under the same conditions, it can be determined whether a 
graph has a Hamiltonian circuit. More examples are given in m- 

A similar algebra was presented in |B] (alternatively, see m)- For this it has 
been shown that all problems expressible in certain versions of monadic second- 
order logic can be solved in linear time for graphs given as decompositions with at 
most k labels |bl7j . (Maximum Clique and c-Colorability are two such problems.) 
The number of labels needed to produce a graph in this algebra is called its 
clique-width. 



A. Brandstadt and V.B. Le (Eds.): WG 2001, LNCS 2204, pp. 229 42401 2001. 
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Both of these algebras generalize the class of cographs, described in [2] for 
example. Moreover, they are both closely related to the kind of graph grammar 
introduced in [^. (Essentially, they enable bottom-up construction correspond- 
ing to the top-down substitution mechanism of such a grammar.) The main 
difference between them is that in the latter, edges are not added when two 
graphs are united, but with a separate unary operator instead. Indeed, it turns 
out that a derivation according to one of these algebras easily can be turned 
into a derivation according to the other, with either no more labels (for the 
first algebra) or at most twice as many (for the second) [1^ . And for both alge- 
bras it is unknown whether a potential decomposition with at most k labels can 
be found in polynomial time, except for very small values of k. For cographs, 
which correspond exactly to one and at most two labels, respectively, decom- 
position algorithms follow easily from [3|. It has thereafter been shown that an 
NLC 2 -decomposition can be found in 0(n^ log n) time |H]. And a decomposition 
algorithm for the algebra in [H] with three labels has been outlined in |T]. 

Below is presented an algorithm that can find an NLC-decomposition of 
minimum width for any graph G. It works by building, according to the rules 
of the algebra, larger and larger candidates — graphs that are consistent with 
G in the sense that they can occur in derivations of it. Until G itself has been 
obtained, the number of labels used, k, is incremented whenever nothing more 
can be done with the current labels. Of course, the algorithm is not intended to 
be used for large values of k. A mechanism is provided for removing candidates 
that no longer contain any unique information. Yet, a problem with this exact 
algorithm is that it is hard to see how large the set of produced candidates may 
become. The algorithm has therefore been developed further, so that if G has 
NLC-width k, an NLC-decomposition of width at most k log n can be found in 
Q(^ 2 fc+i) Note that this decomposition can then be turned into a “clique- 

decomposition” of width at most 2 log n times the clique- width of G. 

The implications of this for dynamic programming will vary. Let us consider 
the c-Colorability problem, for instance. It is not hard to devise an algorithm 
that runs in 0(2^'^^n) time on an NLCfc-decomposition and determines if the 
corresponding graph has a c-coloring. (There are at most n union operations 
in which we combine and evaluate 2“^ possible color configurations for each 
operand. Relabeling operations require less work.) Thus, if k is O(logn), the 
running time will still be polynomial. 

Some additional notes on the relationship with other graph classes may be 
of interest. To begin with, bounded tree- width implies bounded NLC-width and 
clique- width m mb\ . In contrast, bounded NLC- width/clique- width does not 
in general imply bounded tree- width. However, for graphs without large complete 
bipartite subgraphs, it does P!- Finally, distance-hereditary graphs have clique- 
width at most 3, whereas unit interval graphs and permutation graphs have 
unbounded clique-width m 
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2 Basic Definitions 

With a labeled graph G is meant a graph with a labeling function, labc, mapping 
each vertex u to a positive integer. We say v is labeled with (or has the label) 
labc{v). The unlabeled graph one obtains by disregarding labc is denoted by 
unlab(G), and the vertex and edge sets of G are written V(G) and E(G). 

Two labeled graphs Gi and G2 are disjoint if V(Gi) fl V(G2) = 0 . Their 
disjoint union is then defined as the labeled graph G given by V(G) = V(Gi) U 
V(G2), E(G) = E(Gi) UE(G2), labciu) = labcM) for all u G V(Gi), and 
labciu) = laba^iu) for all u G V(G2). 

Definition 1 (Union). Let Gi and G2 be disjoint labeled graphs, and let S C 
Z+ X Z+. Then (Gi) X5 (G2) is the graph one obtains by forming the disjoint 
union of Gi and G2, and adding to that all edges {u,v} satisfying u G V(Gi), 
V G V(G2), and {laboiiu), labG 2 (v)) G S. See Fig. |T] 

Definition 2 (Relabeling). Let G be a labeled graph, and let i? be a partial 
mapping from Z+ to Z+. Then o^.(G) is the labeled graph G' defined by V(G') = 
V(G); E(G') = E(G); and for all u G V(G'), labciu) = R{labG{u)) if Zo&g(w) is 
in the domain of R, and labG'{u) = labG{u) otherwise. See Fig. [T] 
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Fig. 1. Union and relabeling. G 3 = (Gi) X{( 2 ,i)} (G 2 ) and G 4 = °{( 2 ,i)} (G 3 ). 



Definition 3 (NLCfc, NLCfc-term). An NLCfe-term D expresses a valid use 
of the above operators xg, with S C [fc]^, and o^, with R partial from [A:] to 
[fc], as well as the unary operator Ai, with AG [A:]. The latter produces a graph 
consisting of a single vertex labeled with i, and for our purposes, the operand 
must be the name of the vertex. G{D) denotes the labeled graph produced by 
D, and NLCfc is the class of (labeled) graphs produced by NLCfc-terms. An 
NLC-term is an NLCfe-term for some value of k. 

Definition 4 (NLCfc-decomposition). An NLCfc-decomposition of a graph 
G is an NLCfe-term D such that G = G{D) if G is labeled, or such that G = 
unlab(G(U)) if G is unlabeled. An NLC-decomposition is an NLCfe-decomposi- 
tion for some value of k. 

NLC-terms are best though of as binary trees. The width of such a term is 
the number of labels used in it, and the NLC-width of a graph is the minimum 
width among all its NLC-decompositions. 
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For a graph G and a set of vertices V, G\V denotes the induced subgraph 
of G that remains when vertices in V have been removed. Similarly, for an NLC- 
term D, D \V denotes the subterm of D we obtain by deleting all subterms 
corresponding to single vertices in V, and removing also all operations that 
become superfluous. Evidently, G{D \ E) = G{D) \ V. In the following, we 
simply say that we remove vertices from a term or decomposition. 

3 An Exact Algorithm 

In this section we investigate how one can find an NLC-decomposition of min- 
imum width for a given graph. For simplicity, all graphs are assumed to be la- 
beled. (This is no restriction.) We first characterize those graphs that can occur 
in derivations of the input graph, and then take a brief look at the algorithmic 
idea. 

Let u, V, and w be vertices in a graph G. We say that w distinguishes u from 
u in G if it has an edge in this graph to one of u and v, but not to the other. 

Definition 5 (NLCfc-candidate). An NLCfc-candidate for a graph G is a la- 
beled graph G satisfying (i) V(G) C V(G); (ii) the edges of G are those induced 
by G; (iii) for any pair of vertices u and u in G with labc{u) = labc{v), we have 
(a) labciu) = labciv), and (b) no vertex in V(G) \ V(G) distinguishes u from 
V in G; and (iv) G G NLCfc. An NLC-candidate (or candidate, simply) is an 
NLCfc-candidate for some value of k. 

Specification 1 (Basic idea). To NLC-decompose a graph G, we are going 
to initialize a candidate set to the single vertices in G, each labeled with 1, and 
then add repeatedly to this, NLC-candidates for G that can be formed (using 
union and relabeling) from those already in the set (that is, from copies of these). 
More precisely, we will add NLCi-candidates in a first stage until this leads no 
further, then NLC 2 -candidates in stage two, and so on, and we shall record how 
each new candidate was formed. Thereby, we will find a decomposition of G of 
minimum width. 

For this approach to be successful, we must be able to keep down the size 
of the candidate set. We shall first consider the need for differing candidates 
on the same vertex set. For V C V(G), let Pg(E) denote that partition of V 
in which two vertices belong to the same class if and only if (i) they have the 
same label in G, and (ii) they are not distinguished by any vertex in V(G) \ V. 
Also, let a vertex class of a labeled graph be a maximal, uniformly labeled set 
of its vertices. Then, if two vertex classes in a candidate G for G are subsets of 
the same class in Pg(V(G)), one of these vertex classes can in fact be merged 
into the other (by a relabeling) without affecting the role of G in a derivation 
of G. We shall make use of this. Let the dimension of G, dim G, be the number 
of vertex classes it has. The candidate G is now said to be label-compact with 
respect to G if (i) the vertex classes of G are equal to the classes of Pg(V(G)), 
and (ii) the range of labc is {1, ■ ■ ■ ,dimG}. 
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Specification 2 (Union step). When we try to form a new candidate through 
union of two old ones, we shall allow relabeling of these as part of a union step. 
Hence, for a particular vertex set, we need to save at most one candidate, and 
we will choose this to be label-compact. Thus, we let each union step end with 
a relabeling as well. The width of a union step is the number of labels involved. 
(Naturally, we use no more than necessary.) 

We shall now inquire whether a candidate can replace smaller candidates 
that it “includes”. 

Definition 6 (Extension). Let Ci and C2 be NLC-candidates for a graph G. 
If Cl is an induced subgraph of C2 such that each vertex class of C2 contains, as 
a subset, one vertex class of Ci, then C2 is an immediate extension of Ci with 
respect to G. More generally, an extension of Ci with respect to G is any NLC- 
candidate for G that can be obtained by a relabeling of an immediate extension 
of Cl. 



Lemma 1. If G\ and C 2 are NLC-candidates for a graph G, then C 2 is an 
extension of G\ if and only if (i) each vertex class of Ci is a subset of some 
vertex class 0/C2, and (ii) each vertex class 0/C2 contains, as a subset, at least 
one vertex class of G\. 

Lemma 2. Let G\ be an NLC-candidate for a graph G. If C 2 is an extension 
ofGi and C3 is an extension 0/C2 (in both cases with respect to G), then C3 is 
also an extension of Ci . 

Lemma 3. Let Ci and C 2 be NLC-candidates for a graph G such that C 2 is 
an arbitrary extension of Gi, let D\ and D2 be NLC- decompositions of C\ and 
G2, and let D be an NLC-decomposition of G containing D\ as a subterm. Then 
there exists a mapping R2, such that we get a new NLC-decomposition D' of G 
by replacing Di by oj^.^(D2). (We first remove each vertex in V(C2)\V(Ci) from 
D.) 

Lemma [3] suggests that when we add a candidate C to the candidate set 
S, we may be able to remove from S any candidate which C is an extension of. 
This is not self-evident, though. Suppose that a decomposition hinges on a union 
step involving two particular graphs. What if S contains not both of these, but 
extensions of them which happen to overlap? Obviously, we must consider what 
we can do with nondisjoint candidates. If we extend the union step as described 
next, the suggestion above will work fine. 

Specification 3 (Union step, extended application). Two nondisjoint 
NLC-candidates Ci and C2 for a graph G are separable if either C2 \ V(Ci) 
or Cl \ V(C2) is also an NLC-candidate for G. When this is the case, we can get 
an NLC-candidate for G on the vertex set V(Ci) U V(C2) through a union step, 
either for Ci and C2 \ V(Ci), or for C2 and Ci \ V(C2). We will call this a union 
step for Cl and C2. Naturally, if there is a choice, we minimize the width of this 
underlying operation. 
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Specification 4 (Replacement rule). Each time we have produced a new 
candidate C through a union step for two members of the candidate set S, we 
shall do as follows: If S already contains an extension of C, then we discard 
C immediately. Otherwise we remove from S each candidate which C is an 
extension of, and we then add C instead. 

We need to prove that the algorithm is still correct. If we generalize Lemma E] 
first, such that D there can be a decomposition of any NLC-candidate for G, we 
can then show: 

Theorem 1. For each NLCfe-candzdaie C for the input graph G, the candidate 
sets contains an extension of C at the end of stage k. 

Proof. Induction over candidate size. Sketch of the induction step: We have 
C = Ofl;((Gi) Xs (G2)) for some NLCfc-candidates Gi and G2. Assume that S 
contains an extension G( of C\ . Use the generalization of Lemma|3] in particular, 
note that G2 = G2 \ V(G() will be an NLC-candidate. It also belongs to NLC^, 
as C2 does. Thus, assume also that S contains an extension C'f of G^. Use the 
generalized lemma again; this time, note that C'( = G( \ V(G2 ) will be an NLC- 
candidate. Hence, C[ and C'f are separable. Conclude, moreover, that a union 
step for them has width at most k and produces an extension of G. □ 

The replacement rule does not by itself guarantee efficiency of the algorithm. 
Perhaps though, if we prioritize union steps that make the replacement rule come 
into effect, the algorithm will run in polynomial time up to any fixed stage. With 
the approximate algorithm in the next section, we will achieve this performance 
for sure. As a preparation for the presentation there, let us look now at some 
implementation issues which pertain to the exact algorithm also. 

During the course of the algorithm, we need to find the vertex classes (in- 
cluding labels) of candidates. We call this surveying. At the end, we also need to 
reconstruct the decomposition of the input graph. In order to be space-efficient, 
we probably want to represent candidates mainly with their derivations, and 
to survey them by traversing these derivations to some reasonable extent. At 
a union step for two disjoint candidates, we then make full use of their exist- 
ing representations; if G is defined as o/j((o^^ (Gi)) xg (o/j2(G2))), we create a 
node for G containing the parameters R, R\, R2, and S, as well as references 
to Cl and G2. We can then survey G by surveying Gi and G2. However, if G is 
obtained in a union step for two nondisjoint (but separable) candidates Gi and 
G2, it is not enough to refer to both of these, as such a practice could lead to 
undesirably long surveying times. Instead, if Gi, for instance, is the intact part 
in the underlying disjoint union step, we may keep, with the node for G, also a 
list of the vertex classes in G2 \ V(Gi), so that we can survey G by surveying Gi 
and this list. This will allow us to survey a candidate in 0 {kn) time, where k is 
the stage of the algorithm and n = |V(G)|, either by merging vertex lists in a 
bottom-up way, or by composing relabeling operators from the top and down. 

One should note, of course, that when a candidate G is removed from the 
candidate set, it will not always be possible to delete its representation, as this 
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may still be referred to by other candidates. However, these can easily be kept 
track of in a list belonging to C. If this list becomes empty, the node for C can 
be deleted. 

Once the vertex classes are known for two candidates C\ and C2, the (min- 
imum) width, as well as the parameters, of a potential union step for Ci and 
C2 can be determined in 0 {kn) time, where k is the number of labels in either 
candidate, and n = |V(G)|. We say that we match C\ and G2. 

4 A Polynomial-Time Approximative Algorithm 

4.1 Abstract Derivations 

To get a better grip of the candidate set, we shall now see what we can do when 
nondisjoint candidates are not separable. 

Definition 7 (Abstract union step). Let C\ and C2 be label-compact NLC- 
candidates for a graph G, and let V* = V(Ci) UV(C2). An abstract union step 
for Cl and C2 with respect to G produces a label-compact NLC-candidate C for 
G on the vertex set V*. The nominal width of this abstract union step is the 
maximum of the dimensions of Ci, C2, and C. 

To distinguish between union steps as defined previously, and abstract union 
steps, the former will sometimes be called concrete, and their widths actual. 
Hence, the derivations we have been concerned with so far are concrete union 
step derivations, and the width of such a derivation is the highest actual width 
among its union steps. An abstract union step derivation will now be a derivation 
consisting of abstract union steps; its nominal width is the highest nominal 
width among these. Notice that a concrete union step always can be regarded 
as abstract. 

An abstract union step producing a candidate C from candidates Ci and C2 
may be represented by a node for C with the usual parameters R, R\, R2, and 
S, and links to C\ and G2. As in the nondisjoint but separable case, we also need 
a specific representation of either C\ \ ¥(^2) or C2 \ V(Gi) in order to allow for 
efficient surveying (the same method and complexity as before). Note that in 
general, S can make sense only for edges between C\ \ ¥(^2) and C2 \ V(Gi). 
We will not need S though; below, when we transform an abstract derivation 
into a concrete derivation, we will look up the edges in G instead. The value of 
an abstract union step derivation is now made clear by the next theorem. 

Theorem 2 . Let G be a graph with n vertices, and let D be an abstract union 
step derivation of G with nominal width k. Then there exists a concrete union 
step derivation (and thus an NLC-decomposition) ofG with width at most klogn. 

Proof (abridged). Think of D as a tree; for a node x, let V(x) denote the vertex 
set in G corresponding to all the leaves under x. Transform D as follows: for 
each node from the root and downwards, first exchange, if necessary, the left 
and right child I and r, so that |V(I)| > |V(r)|, and then remove from the right 
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subtree each leaf whose corresponding vertex is already represented in the left 
subtree. (In practice, we have to avoid handling all the multiple leaves of D. 
By surveying as previously described, and passing down information on which 
vertices of G are valid for each child subtree, we can transform D in 0{kii?) 
time.) 

Informally, each node x in D now corresponds to an induced subgraph C 
of the NLC-candidate C it formerly corresponded to. Importantly, if y is a leaf 
of D located to the right of the leaves under x, then y corresponds to a vertex 
which is not in V((7), and which cannot distinguish between vertices within a 
vertex class of C . Observe that on the path from the root of D down to any leaf 
y, there hangs at most logn subtrees to the left. Each of these corresponds to 
a graph with at most k vertex classes, and within each such class, vertices are 
uniformly labeled in G, as well as indistinguishable to any vertex whose leaf is 
y, or lies to the right of y in D. Hence, a label-compact NLC-candidate on the 
vertex set corresponding to the leaves to the left of y can have at most k log n 
vertex classes. As y was chosen arbitrarily, it immediately follows that we need 
at most k log n -I- 1 labels for a concrete derivation of G that adds one vertex at 
a time in the order given by the leaves of D. With a more detailed argument one 
can show that fclogn labels is in fact enough. One way (though not the fastest) 
to find the parameters of such a derivation is to perform an ordinary match for 
one vertex at a time, together with the NLC-candidate already obtained from 
the previous vertices. This takes a total of 0{kn^ logn) time. □ 

Specification 5 (Approximation algorithm). We extend the exact algo- 
rithm by performing, at each stage k, also abstract union steps of nominal width 
at most k. 

The approximation algorithm will give us an abstract union step derivation of 
minimal nominal width. We shall not prove this though, since, for our purposes, 
it suffices to note that Theorem [1] still holds; in the proof of that theorem, there 
is in fact no requirement that the extensions C[ and G 2 themselves belong to 
NLCfc. Thus, if the input graph G has NLC-width k, we will find an abstract 
derivation of G with nominal width at most k before the end of stage k. To 
reach this point quickly, we shall prioritize certain union steps that make the 
replacement rule come into effect. A new concept will help us then: 

Definition 8 (Representative). A set of vertices i? is a representative for an 
NLC-candidate C if i? contains at least one vertex from each vertex class in 
C . Regarded as a representative, R is introduced when, for the first time, the 
candidate set is augmented with a candidate which R can represent. 

It follows easily from Lemma [H that a representative for a candidate G also 
is a representative for any extension of C. Hence, once a representative has been 
introduced, it will (that is, can) always represent some member of the candidate 
set. We shall now identify a particular situation in which the abstract union 
step will be of great importance. We then follow up with an observation which 
will prove useful. Together, the two lemmas reveal very clearly the nature of the 
algorithm below. 
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Lemma 4. Let C\ and C2, of the dimensions di and c?2 respeetively, be label- 
compact NLC- candidates for a graph G, and suppose that they have a represen- 
tative R in common. Then an abstract union step for C\ and C2 with respect 
to G has nominal width max{di, ^2}, and the outcome G of such a step is an 
extension of both G± and G2 ■ 

Lemma 5. Let G be an NLC-candidate for a graph G, and assume that the 
candidate set S does not contain any extension of G. Then, for any k satisfying 
dimC < k < |V(C')|, either G, if now added to S, introduces some representative 
of size k, or else G has common representatives of size k with at least two 
candidates in S. 

4.2 The Abstract Derivation Algorithm 

Linksearch. In order to keep down the number of candidates, we are going to 
“apply” Lemma |4] whenever possible, so that the members of the candidate set 
S do not have representatives in common. Before we go into the details of this, 
we need to look at a data structure that we will use to speed up the algorithm. 
We shall keep, at stage fc, a table of all representatives of size at most k. With 
registering a representative i? to a candidate C, we mean marking in the table 
that i? is a representative for G. We register R at most once. Simply stated, 
when we remove a candidate G from S, we keep the node for C with a link to 
the replacing candidate (which has more vertices). Once R has been registered, 
we will be able to find its current member of 5 by a linksearch in 0 {n) time. 



Inspection, Sweep. When a candidate G has been formed, the first thing we 
want to find out is whether S already contains a candidate which is an extension 
of G. It is the main purpose of the table of representatives to assist us in this. If 
the table contained all representatives introduced so far, things would be rather 
simple. However, it may be costly to make sure that the table is always “up to 
date” . The inspection procedure for a new-formed candidate G is therefore a 
little complicated: 

We begin to go through the representatives for C. As long as they are not 
registered, we register them to G. If we find one which is already registered, 
we follow the links to its current candidate C". If this is an extension of G 
(case I) , we are not going to add C to S. Yet, if we have registered representatives 
to C, we keep the node for G, with a link to G' , so that we do not lose the 
new registrations. The inspection is then over. If C' is not an extension of G 
(case 2), no other member of S could be so either (such a candidate would 
have a representative in common with G'), so fundamentally, G should now 
be added to S. However, as we do not want members of S to have common 
representatives, we are first going to apply Lemma [Has long as possible, before 
the resulting extension of G should finally be considered “added” to S. To find 
and carry out these Lemma S] applications, we are mainly going to rely on the 
sweep procedure, defined below. First though, we might as well try to exploit 
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the table of representatives once more. To begin with, we can remove C from 
S and replace C with an extension of both. (The details are the same as in a 
sweep.) Now, as the label-compact C" is not an extension of C, C contains a 
vertex outside C . Hence, C (and thus the common extension too) also has one 
or more representatives which are not representatives for C . We can easily find 
at least one; if it is not registered, we register it; if it already is, we follow the 
links and apply Lemma[3 again. After this, it might no longer be useful to look 
in the table, so instead we send the most recent extension of C to the sweep 
procedure, which will find and take care of any remaining applications of the 
lemma. The inspection is then over. In summary, we have seen what to do if 
we find, via the table, a member of S sharing a representative with C. There 
remains the possibility that we find no previously registered representative for 
C (case 3). The table does then not tell us what to do with this candidate. This 
too we resolve with a call to the sweep procedure. 

In general, a sweep for a candidate C goes as follows: We compare C with 
the members of S until we find one, C , sharing a representative (of any size) 
with C. If C is an extension of C, we link C to C", and the sweep is over. If C is 
an extension of C , we link C to C, remove C from 5, and continue to compare 
C with the remaining members of S. Otherwise, we let C” be the outcome of a 
concrete or abstract union step for C and C"; we link both of these to C", we 
remove C from 5, and we terminate the current sweep, starting instead a new 
sweep for C” . If, in this way, we reach the end of S (such that we would have 
continued with another member, if there was one), we add C to S. 



The Main Procedure. Now that we have looked at the inspection procedure, 
we are ready for a description of the entire abstract derivation algorithm. To be 
able to match candidates in a systematic way, we will keep the candidate set S 
divided into two parts: a core set, whose members have all been matched against 
each other, and a waiting list containing remaining candidates. 

We begin stage k by extending the table of representatives to encompass 
elements of size k also. For each candidate in S, we register all representatives 
of this size. As no two members of S have a common representative of any size, 
this registration is cheap, and it will probably pay off. We then empty the core 
set and put all the candidates of S in the waiting list. We are now ready to begin 
the main loop of stage k: 

As long as there are candidates in the waiting list, we take one, C, and 
we match this against each member of the core set. If we find that a concrete 
or abstract union step can be performed with at most k labels, we stop for a 
moment, and we send the outcome of this to the inspection procedure. If the 
result of that is that a new candidate should be added to S, we place this in 
the waiting list. By now, C may have been removed from S. If so, we go on to 
the next candidate in the waiting list. Otherwise, we return to C and the next 
remaining member of the core set. If the entire core set has been matched with 
C already, C itself is added there, and we continue with another candidate in 
the waiting list. When this is empty, stage k is over. 
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Correctness and Analysis. To see that the abstract derivation algorithm is 
correct, just note that it implements Specifications |T] through |5] (The fact that 
a new candidate is not considered added to the candidate set until it has passed 
the inspection procedure does not change the process itself.) 

We shall now analyze the abstract derivation algorithm. We will arrive at the 
following: 

Theorem 3. The abstract derivation algorithm runs in time and 

0(n^+^) space up to stage k. 

We can observe first that there are (T) representatives of size at 

most k. So obviously, the registration operations make up a diminishing part of 
the overall time consumption, and we shall therefore disregard them below. 

In the sweep procedure, we compare two candidates directly in order to see 
if they have a common representative. If they have, we also determine if one 
of them is an extension of the other. We shall refer to either of these tasks as 
a comparison. Strictly speaking, a comparison can be conducted in 0{n) time. 
However, we naturally want to include the time to survey the candidates in 
question. We shall allow the latter to be O(fcn), as this will not impair the total 
result. Thus, both comparison and match will be 0(fcn)-time tasks. 

Let us also observe that a match often is followed by a linksearch, which in 
its turn is generally followed by a comparison. We shall attribute the cost of all 
of this to the match. As each linksearch is caused by “its own” match, we need 
not mention linksearches and their associated comparisons again. 

Finally, we note that a union step can be stored in 0{n) time and space. 
Theorem |3] now follows from the below lemma, which can be proved from an 
adaptation of Lemma [S) 

Lemma 6. The main procedure performs at most Ak^ + matches up 

to stage k. At most 2Ak of these add a candidate to S. In connection with them, 
the inspection procedure may perform another matches. Hence, at most 4Afc 
union steps may have to be stored. In addition, up to Ak^ comparisons may be 
performed by the sweep procedure. 



4.3 Conclusion 

For any given graph G of NLC-width at most k, the above algorithm gives us 
an abstract union step derivation of nominal width k or less in time 

and space. Certainly within these resource bounds, we can convert this 

derivation to an NLCfciogn-decomposition as in the proof of Theorem |2] 
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Abstract. We introduce two subfamilies of AT-free graphs, namely, 
path orderable graphs and strong asteroid free graphs. Path orderable 
graphs are defined by a linear ordering of the vertices that is a natural 
generalization of the ordering that characterizes cocomparability graphs. 
On the other hand, motivation for the definition of strong asteroid free 
graphs comes from the fundamental work of Gallai on comparability 
graphs. We show that cocomparability graphs C path orderable graphs 
C strong asteroid free graphs C AT-free graphs. In addition, we settle the 
recognition question for the two new classes by proving that recognizing 
path orderable graphs is NP-complete whereas the recognition problem 
for strong asteroid free graphs can be solved in polynomial time. 



1 Introduction 

We say that a vertex in a graph G = (V, E) intercepts a path in G if it is adjacent 
to at least one vertex of the path, and it misses the path otherwise. An asteroidal 
triple {AT for short) is an independent set of three vertices such that, between 
every pair, there is a path that is missed by the third. A graph is AT-free if it 
does not contain an AT. 

One of the most compelling motivations for the study of AT-free graphs 
is the idea that these graphs exhibit a type of linear structure. Indeed, the 
linear structure exhibited by AT-free graphs is explained, in part, in [1], where 
it is shown that every connected AT-free graph contains a dominating pair (two 
vertices such that every path connecting them is a dominating set), and a type 
of linear “shelling sequence” called a spine. 

The original motivation for the results of the present paper was the idea 
that AT-free graphs might be further characterized by the existence of a vertex 
ordering satisfying certain conditions. 

Vertex orderings have proven to be useful algorithmic tools for several families 
of graphs. For example, chordal graphs (respectively, cocomparability graphs) 
are characterized by the existence of vertex orderings that do not contain the 
forbidden ordered configuration shown in Figure [T] (a) [2] (respectively, (b) [S]). 
A graph is an interval graph if and only if it has a vertex ordering that contains 
neither of the configurations of Figure [T] (see for example [ID])- Such vertex 
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orderings are referred to as chordal orderings, cocomparability orderings, and 
interval orderings, respectively. 



(a) 



(b) 



Fig. 1. Forbidden ordered configurations. 



In other words, in an interval ordering, for every path on two vertices (that 
is, for every edge), the left endpoint of the path is adjacent to all vertices be- 
tween the two endpoints of the path. In a cocomparability ordering, each vertex 
between the two endpoints of a P2 is adjacent to one or both endpoints of the 

P 2 - 

It is well-known that interval graphs are exactly those graphs that are both 
chordal and cocomparability [5j or, equivalently, both chordal and AT-free [^. 
Furthermore, cocomparability graphs form a proper subclass of AT-free graphs 

m- 

An alternate characterization of the cocomparability ordering is given in 
Observation m 

Observation 1. A vertex ordering vi,...,Vn of graph G is a eocomparahility 
ordering if and only if Vi,Vj,Vk with i < j < k, vertex Vj intercepts each 
Vi, Vk -path ofG. 

From this, one can easily see that a cocomparability graph must be AT- 
free since any independent triple occurs in some order, say x < y < z, in & 
cocomparability ordering, and thus, there cannot exist an x, z-path missed by y. 

In an attempt to generalize the cocomparability ordering while retaining the 
AT-free property, we introduce the following definition. 

Definition 1. A graph G = (V, E) is path orderable if there is an ordering of 
the vertices v\, . . . ,Vn such that for each triple of vertices Vi, Vj,Vk with i < j < 
k and ViVk ^ E, vertex Vj intercepts each Vi,Vk~path of G; such an ordering 
vi, ... ,Vn of V is called a path ordering. 

It is easily seen by Observation [U and Definition [U that cocomparability 
graphs are all path orderable. C5, the chordless cycle on five vertices, is a path 
orderable graph which is not a cocomparability graph. It is clear that path 
orderable graphs must be AT-free; however, while it was hoped that Definition [T] 
might characterize AT-free graphs, we shall see later that path orderable graphs 
form a strict subset of AT-free graphs; as an example see the graph in Figure H] 
It is obviously AT-free, however it has no path ordering, as will be shown later. 

Nevertheless, since path orderable graphs are interesting in their own right, 
we attempted to provide a structural characterization of this graph class by 
identifying a type of forcing relation on the nonadjacent pairs of vertices and 
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Fig. 2. AT-free graph G which is not path orderable. 



the type of structure which would make the vertex ordering of Definition [T] 
impossible. 

These investigations follow in Gallai’s footsteps in that they involve ideas 
similar to his forcing relation on the edges of a comparability graph (equivalently, 
the nonedges of a cocomparability graph) and his definitions of wreaths and 
asteroids. Specifically, we define strong asteroids and show that path orderable 
graphs are strong asteroid free. However, it turns out that the strong asteroid 
concept does not provide a characterization of path orderable graphs; we shall 
see that path orderable graphs form a proper subclass of strong asteroid free 
graphs which, in turn, form a proper subclass of AT-free graphs. 

Thus, we will identify two distinct subclasses of AT-free graphs, both of which 
contain cocomparability graphs: 

cocomparability C path orderable C strong asteroid free C AT-free 

The interest lies, in part, in the natural vertex ordering, in one case, and the 
relationship with Gallai’s work, in the other case. Furthermore, the identification 
of these graph classes should allow us to narrow the gap between known polyno- 
mial and known NP-complete behaviour of problems in the domain of AT-free 
graphs. 

We conclude the paper with a proof that the recognition of path orderable 
graphs is NP-complete, and a polynomial time recognition algorithm for strong 
asteroid free graphs. We note that the NP-completeness result settles an open 
problem stated in m- 

2 Background 

In his paper on comparability graphs, Gallai studies the forcing between the 
edges imposed by a transitive orientation (to avoid misunderstandings, from 
now on we will refer to the transitive- forcing as t-forcing). Let G be a (not 
necessarily comparability) graph. Two edges which share a common endpoint 
and whose other endpoints are nonadjacent t-force each other directly. That 
is, in any transitive orientation, either both edges are directed away from the 
common endpoint, or both are directed towards it. The transitive closure of the 
direct t-forcing relation partitions the edges of a graph into its t-forcing- classes. 
Either there are exactly two different transitive orientations of the edges of a t- 
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forcing-class, or there are none. The latter case occurs when some edge is t-forced 
in both directions, in which case the graph is not a comparability graph. Edges xy 
and xz are said to be knotted if y and z are connected in N{x), the complement 
of the subgraph of G induced by N(a:), where N(x), the neighbourhood of x, is 
defined by N{x) = {u\ux £ E}. 

To capture the t-forcing in a given graph G, Gallai uses the concept of a 
knotting graph: For a graph G = (V,E) the corresponding knotting graph is 
given by K[G] = {Vk,Ex) where Vk and E^: are defined as follows. For each 
vertex v of G there are copies vi,V 2 , ■ ■ ■ ,Vi^ in Vk, where iy is the number of 
connected components of N(w), the complement of the graph induced by N(u). 
For each edge vw of E there is an edge ViWj in Ek, where w is contained in 
the connected component of N(w) and v is contained in the j**' connected 
component of N(ru). 

In this graph two edges are incident if they are knotted. The edges of the 
t-forcing classes of G are given by the connected components of K[G]. Using this 
structure, Gallai shows that a graph G is a comparability graph if and only if 
K[G] is bipartite. 

The following definitions describe structures which lead to t-forcing classes 
which cannot be transitively oriented, and knotting graphs which are not bipar- 
tite. 

Definition 2. An odd wreath of size k in a graph is a cycle of knotted edges, 
specifically, a sequence of vertices vo,vi,V 2 , ■ ■ ■ ,Vk where k is odd, v\, . . . ,Vk are 
distinct, vq = Vk, and \/0 < i < k, edges ViVi+i and Vi+\Vi +2 exist in the graph 
and are knotted (addition modulo k). 

Definition 3. An odd asteroid of size k in a graph is a sequence of vertices 
vo,vi,V 2 , ■ ■ ■ ,Vk where k is odd, v\, ... ,Vk are distinct, vq = Vk, and VO < i < fc, 
3 a ViVi+i-path in G which is missed by 

Gallai points out that an asteroid is the complement of a wreath, and proves 
that a graph is a comparability graph if and only if it contains no odd wreath 
or, equivalently, a graph is a cocomparability graph if and only if it contains no 
odd asteroid. 

3 Path Orderable Graphs 

and Strong Asteroid Free Graphs 

t-forcing is a fundamental concept for comparability graphs, and thus for cocom- 
parability graphs as well. Given the similarities of the linear ordering character- 
izations of path orderable graphs and cocomparability graphs, one might expect 
a similar forcing concept for path orderable graphs. In fact such is the case. 

For a graph G and a vertex u of G let Gi , . . . , Gfc be the connected components 
of G \ (N (u) U {v}) and let Bj,. .. ,Bf he the connected components of the graph 
induced by the vertices of Gi in G (1 < i < /c); the Bf are called the blobs of v 
in G. 
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Lemma 1. Let G be a path orderable graph and vi, . . . ,Vn a eorresponding path 
ordering. For every vertex v of G and every blob B of v, the vertices of B either 
occur all before v in the path ordering or all after v in the path ordering. 

Proof. Suppose there is a vertex v and a blob B ofv with u,w € B and u < v < w 
in the path ordering of G. By the definition of blobs, u and w are in the same 
connected component C of C?\N(u). Since u and w are also in the same connected 
component i? of C in G, there has to be a path of non-edges in B between u and 
w. Thus, there is a pair of vertices u', w' in B with u'w' ^ E and u' < v < w' . 
But u' ,w' G C; therefore there is a tt', ic'-path in G \ N(u), contradicting the 
path ordering. □ 

By Lemma [1] any path ordering has to fulfill the property that if one of the 
vertices u of a blob B oi v precedes v in the ordering, then all of the vertices 
of B have to be in front of v as well. As an example consider the graph in 
Figure E] Following the above definition of blobs, vertex 3 has the three blobs 
{6,7, 8,9}, {5}, {!}; vertex 7 has the blobs (3, 1,9}, {2}, {5}; vertex 8 has the 
blobs {3,5,6}, {4}, {1}; vertex 5 has only the blob {1, 2, 3, 6, 7, 8, 9} and vertex 
1 has only the blob {3, 4, 5, 6, 7, 8, 9}. Suppose now there is a path ordering of 
G. By Lemma [H we can, without loss of generality, assume that 1 is in front of 
all vertices of its blob and thus 5 appears after all vertices of its blob in the path 
ordering; in particular vertices 3, 6, 7, 8, 9 are between 1 and 5. Since 7 and 8 are 
in the same blob of 3, they either appear both before or both after 3 in the path 
ordering. However, if they both appear before 3, then, again by Lemma [T] we 
have a contradiction because 3 and 1 are in the same blob of 7, but on different 
sides in the path ordering. On the other hand, if both 7 and 8 appear after 3 in 
the path ordering we have again a contradiction, since 3 and 5 are in the same 
blob of 8 but on different sides in the path ordering. Hence there cannot be a 
path ordering for the graph in Figure [21 

When interpreting the constraints of Lemma [T] as orientations of the edges 
of G, in the sense that edges from the same blob of a vertex u to u in G have to 
have the same orientation, one can define the following forcing on the edge set 
of G. 

Let G be a graph (not necessarily path orderable) and let ei = uv, = vw 
be edges of G with a common end- vertex v. Then one can define a relation k, by 
Cl Ri 62 (ci and 62 force each other or are knotted at v) if and only if u and w 
are in the same blob of v (possibly u = w) in G. The consecutive application of 
this relation defines a class partition of the edges of G, where two edges e^, 
are in the same class (forcing class) of G if there is a sequence ei, 62, . . . , e/c of 
edges such that Cq = ei ~ 62 ~ ~ 6fc = e;,. Observe that the forcing classes 

are refinements of the t-forcing classes. 

An orientation of the edges of G is said to agree with the forcing if for any 
vertex v and any blob B oi v all edges between B and v are oriented in the same 
direction (either towards v or away from v). For a graph G a linear ordering 
ui , . . . , of the vertices of G is said to agree with the forcing if the corresponding 
implied orientation of the edges of G (uv is oriented from u to u if it < u in the 
linear ordering) agrees with the forcing. 
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Note that when the orientation of one of the edges of a forcing class is fixed, 
then the orientation of all the edges of its forcing class is determined; hence, 
either there are exactly two different orientations of the edges of a forcing class 
that agree with the forcing, or none. In the latter case, some edge is forced to 
be oriented in both directions, meaning that there is no ordering consistent with 
the forcing. 

Lemma 2. A graph G is path orderable if and only if there is a linear ordering 
of the vertices of G agreeing with the forcing. 

Proof. If G is path orderable, then, by Lemma[l] the path ordering has to agree 
with the forcing relation. 

Suppose there is a linear ordering of G, that agrees with the forcing relation 
and suppose there is a triple u < v < w of vertices that violates the path ordering 
property, i.e. uw ^ E and there is a u, w-path in G \ N(u). Hence, u and w are 
in the same connected component G of G \ N(u) and, since uw ^ E, u and w 
are also in the same blob B of v. But then this ordering does not agree with the 
forcing relation; contradiction. □ 



Corollary 1. A graph G is path orderable if and only if there is an acyclic 
orientation of G, agreeing with the forcing relation. 

Proof. Determine a topological ordering, using the acyclic orientation of G; then 
the corollary follows from Lemma El □ 

One can define a graph, similar to Gallai’s knotting graph, representing the 
forcing classes of G. For a graph G = (V, E) the altered knotting graph is given 
by K*[G] = {Vk, Ek) where Vk and Ek are defined as follows. For each vertex 
V of G there are copies v\, . . . in Vk, where is the number of blobs of v in 
G. For each edge vw of E there is an edge ViWj in Ek, where w is contained in 
the blob of u in G and v is contained in the blob of w in G. 




Fig. 3. A graph G together with K[G] and K*[G]. 



As Gallai did for the knotting graph, we draw the altered knotting graph 
K*[G] of a given graph G by putting different copies of the same vertex close 
together. 
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Our next task is to examine configurations which cannot occur in path or- 
derable graphs. In a step toward this goal, we define restricted types of wreaths 
and asteroids. 

Definition 4. An odd strong wreath of size k in a graph G is a sequence of 
vertices vq,vi, . . . ,Vk where k is odd, v\, ... ,Vk are distinct, vq = Vk, and VO < 
i < k, edges and Vi+iVi +2 exist in the graph and are knotted in the altered 

sense, that is, Vi and Vi +2 in the same blob of Vi+i in G (addition modulo 
k). 

Definition 5. An odd strong asteroid of size k in a graph G is a sequence 
of vertices VQ,v\,...,Vk where k is odd, v\,...,Vk are distinct, vq = Vk, and 
VO < i < k, Vi and Vi+i are in the same blob in G. 

The two notions are complementary, that is, a graph G has an odd strong 
wreath if and only if G contains an odd strong asteroid. Furthermore, strong 
asteroids and strong wreaths are restricted types of asteroids and wreaths. We 
also note that the asteroidal triples are the odd strong asteroids of size three. 

Definition 6. A graph G is strong asteroid free if it does not contain an odd 
strong asteroid. 

Similar to the t-forcing results, the following holds: 

Lemma 3. The forcing-classes of a graph G correspond exactly to the connected 
components o/K*[G]. 

The next two observations follow from the fact that an odd strong asteroid 
of size A: in G corresponds to an odd cycle of size k in K* [G] . 

Observation 2. A graph G is strong asteroid free if and only i/K*[G] is bipar- 
tite. 



Observation 3. A graph G is AT-free if and only i/K*[G] is triangle- free. 



Lemma 4. If a graph G is path orderable then K* [G] is bipartite. 

Proof. Let tii, . . . , r;„ be a path ordering of G. Now orient the edges of K*[G] as 
follows: ViVj is oriented from Vi to Vj if i < j. Now, by Lemma [3 each vertex of 
K*[G] has either only incoming or only outgoing edges. Hence, it is bipartite. □ 

Not only does the graph in Figure show that path orderable graphs are 
strictly contained in AT-free graphs, it also establishes that strong asteroid free 
graphs are strictly contained in AT-free graphs, as shown in the next lemma. 

Lemma 5. The class of strong asteroid free graphs is strictly contained in the 
class of AT-free graphs. 



248 



Ekkehard Kohler et al. 



Proof. Consider the graph of Figure [H It is easy to check that 1 3 5 7 8 is an 
odd strong asteroid in G, and that G is AT-free. □ 

In the case of comparability graphs, Gallai not only showed that the knotting 
graph K[G] of a comparability graph is bipartite, but also proved that a bipartite 
knotting graph K[G] is a sufficient condition for G being a comparability graph. 
The major tool that he used for proving this result is a lemma which shows the 
following. Given a bipartite knotting graph K[G] and consider a triangle of G 
with the property that at least two of the edges of the triangle are in the same 
t-forcing-class. Then in any orientation of G that agrees with the t-forcing the 
triangle is not oriented cyclically. 

It turns out that a similar lemma holds for strong asteroid free graphs, too. 
Specifically, for a graph G with a bipartite altered knotting graph K*[G], any 
orientation of G that agrees with the forcing relation does not contain a cyclically 
oriented triangle. However, contrary to the t-forcing relation, this lemma is not 
enough to imply that the orientation is acyclic and, indeed, we shall show that 
this is not necessarily the case. 

Observation 4. Given a vertex v in a graph H and vertices u,w G N(u), which 
are the endpoints of a P 4 in N(ri), then the edges uv and wv are forcing each 
other (see Figure^. 



V V 

Fig. 4. Vertex v with P4 in N(u) together with corresponding altered knotting graph 
K*[H]. 




Remark 1. Using this observation one can create a forcing path, i.e. a path P, 
where each consecutive pair of edges of P is knotted at the common end-vertex 
by the help of an added P4 as described in Observation [J — see Figure 0 (in the 
following, edges and vertices of the path P itself are called original edges/vertices, 
the added edges and vertices are denoted as auxiliary edges/vertices). By the 
forcing, the orientation of any original edge of P forces the orientation of all 
other original edges of P. Note that the knotting graph of a forcing path does 
not contain a triangle or any odd cycle. Furthermore, if P has even length then 
the end-edges of P are either both oriented towards the inner vertices of P or 
both oriented outwards from the inner vertices of P. Similarly, if P has odd 
length the end edges of P have opposite orientations with respect to the inner 
vertices of P. 

Theorem 5. The class of path orderable graphs is strictly contained in the class 
of strong asteroid free graphs. 
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Fig. 5. A forcing path of length 4 (original edges and vertices are bold). 



Proof. Consider the left graph in Figure El This graph is the complement of a 
strong asteroid free graph G. This is proved by constructing the altered knotting 
graph K*[G] (see the right graph in Figure E)- By Observation 2] the thick edges 
force each-other, as shown in the altered knotting graph; and, without having a 
strong asteroid in G, there is a forced oriented cycle on the vertices x\,. . . ,Xk 
in G. Consequently, by Corollary [T] G is not path orderable. This construction 
holds for any fc > 4. □ 





Fig. 6. Complement of a strong asteroid free graph, which is not path orderable (left), 
together with its altered knotting graph (right). To ease understanding of its structure, 
in the knotting graph only for one of the “arms” of the example the corresponding 
auxiliary P 4 vertices are drawn in the figure. One of the two possible forced orientations 
of the main forcing class is given in the right picture. 



4 Recognition of Path Orderable 
and Strong Asteroid Free Graphs 

In this section, we show that the recognition of path orderable graphs is NP- 
complete. This result answers a question which is posed by J. Spinrad in [12] . In 
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contrast, we describe how to recognize strong asteroid free graphs in polynomial 
time. 

If there is only one forcing class for the edge set of G one can check in 
polynomial time whether G is path orderable: Compute K*[G], check whether 
it is bipartite, assign an orientation to K* [G] by orienting all edges from one of 
the bipartition classes to the other and check whether this orientation is acyclic 
on G. 

Similarly one can check whether G is path orderable if the number of forcing 
classes of G is bounded by a constant. 

For comparability graphs Gallai’s results for the general case, i.e. where no 
assumption on the number of edge classes is made, leads to a polynomial time 
recognition algorithm. For this he introduced the (by now well-known) concept 
of modular decomposition and proved that, using this decomposition scheme, 
the problem of recognizing comparability graphs reduces to the problem of rec- 
ognizing prime comparability graphs. But what about the recognition of path 
orderable graphs? Can one extend the decomposition scheme to this problem? 

NOT-ALL-EQUAL 3SAT g 

INSTANCE: Set U of variables, collection C of clauses over U such that each 
clause c S C has |c| = 3. 

QUESTION: Is there a truth assignment A for U such that each clause in G has 
at least one true literal and at least one false literal? 

Remark 2. Without loss of generality one can assume that none of the clauses 
contains more than one literal of a variable. 



Theorem 6. Recognition of path orderable graphs is NP-complete. 

Due to space restrictions in this extended abstract we leave out the proof for 
this theorem but instead sketch the main ideas of the construction. 

The proof is done using a transformation from NOT-ALL-EQUAL 3SAT. 
Given an instance / of NAE 3SAT, a graph G is constructed, which is the 
complement of a path orderable graph if and only if I is NAE 3 SAT-satisfiable. 
In particular it is shown that I is NAE 3 SAT-satisfiable if and only if there is 
an acyclic orientation of G that agrees with the forcing. By Corollary [1] this is 
equivalent with G being path orderable. 

The basic construction of G is done as follows. For every variable x oiU an 
edge Cx is created (in the following called a variable edge) and the two possible 
orientations of Cx are associated with the two possible values true and false of 

X. 

For each clause G = x\/ yW z with literals x,y,z a gadget is constructed, 
mainly consisting of two G4S as shown in FigureH In each of the G4S three of the 
edges (the base edges) correspond to the three literals x,y, z of G; a true literal of 
G corresponds to a clockwise orientation of the corresponding base edges in both 
of the G4S, whereas a false literal corresponds to a counter-clockwise orientation 
of the corresponding base edges in both G4S. Furthermore, in each orientation 
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Fig. 7. Gadget for clauses. 



that agrees with the forcing, the fourth edges of the two C4S will be guaranteed 
to have opposite orientations in the two C4S. 




Fig. 8. General structure of K* [G] for the instance I = {x\ V *2 V 2:3) A {x2 V 2:3 V xa) A 
(irl V 2:3 V Iff) (auxiliary vertices and edges are omitted). 



The general structure of the connection between variable edges and base 
edges by forcing paths is shown in Figure E] for easier understanding, the aux- 
iliary edges and vertices of the forcing paths are omitted in this picture. For a 
variable edge ex (see the top of Figure [HI) a downwards orientation corresponds 
to assigning false to variable x whereas an upwards orientation corresponds to 
assigning true to x. For each literal x or x, there is a forcing path of length 
4, having e^, and the corresponding base edge as its end-edges; depending on 
whether the literal is x or x, either the start- or the end-vertex of the base edge 
(with respect to a clockwise ordering in the C 4 ) is made the end-vertex of the 
forcing path. 
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Now, by Remark [T] assigning an upwards orientation to the variable edge Cx 
results in the desired clockwise orientation of the base edges of each instance 
of the literal x and a counter clock-wise orientation of the base edges of each 
instance of the literal x for any orientation agreeing with the forcing. 

All we have to show now is that there is an acyclic orientation of G agree- 
ing with the forcing relation if and only if C has a NOT- ALL-EQUAL 3SAT 
satisfying assignment. 

By the forcing of the edges and the appropriate knotting of the forcing path 
from the variable representing edges to the edges representing the literals, each 
true literal in a clause C leads to a clockwise oriented edge and analogously, 
each false literal implies a counter-clockwise oriented edge in the correspond- 
ing C4S. Since every clause has at least one true and one false literal, each of 
the C4S has both an edge that is oriented clockwise and one that is oriented 
counter-clockwise. Hence, none of the C4S is cyclically oriented and by the above 
observations that only the C 4 S are possible candidates for oriented cycles, the 
orientation is acyclic. 

Suppose now that there is an acyclic orientation of G that agrees with the 
forcing relation. We assign to a variable x of U the value true, if the edge rep- 
resenting variable x (edges on top of Figure |SJ is oriented upwards and false 
otherwise. Since the orientation agrees with the forcing relation, all we have to 
show is that all of the clauses have at least one true and one false literal. Sup- 
pose there is a clause C, which has only true (false) literals. By the definition 
of G and the forcing relation, three edges in each of the C4S in C”s gadget are 
oriented counter-clockwise (clockwise). Since the “fourth-edges” have opposite 
orientations in the two C4S of G, exactly one of the C4S is oriented cyclically, 
contradicting the orientation of G being acyclic. 

A polynomial time recognition algorithm for strong asteroid free graphs fol- 
lows from Observation 12 Given graph G, the altered knotting graph of G, K*[G], 
can be computed in polynomial time: for each vertex v of G, the blobs of v in 
G can be computed in O(n^) time; each vertex has fewer than n blobs. Thus, 
K*[G] has O(n^) vertices and O(n^) edges (since each edge of G corresponds 
to exactly one edge of K*[G]), and can be constructed in O(n^) time. To test 
whether K*[G] is bipartite can be done in O(n^) time. Overall, the recognition 
algorithm requires O(n^) time. 

5 Concluding Remarks 

We have defined two graph classes and shown that cocomparability graphs C 
path orderable graphs C strong asteroid free graphs C AT-free graphs. Further- 
more, we have shown that the recognition problem for path orderable graphs is 
NP-complete, and the recognition of strong asteroid free graphs can be solved 
in polynomial time. We note that AT-free graph recognition is also in P |ll7j . 

Although it is somewhat disappointing that no two of these families are 
equivalent, these classes may give insight into some open problem complexities 
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on AT-free graphs. By adding graph classes in the hierarchy between cocompara- 
bility graphs and AT-free graphs, we may be able to identify more precisely the 
boundary between polynomial and NP-complete behaviour of some of the prob- 
lems which are known to be polynomially solvable on cocomparability graphs 
but either NP-complete or unresolved on AT-free graphs. Examples of such prob- 
lems include graph colouring, clique cover, clique, and the Hamiltonian path and 
cycle problems. One step in this direction is the observation that the clique prob- 
lem is NP-complete for path orderable graphs. This follows from the facts that 
the complements of triangle-free graphs are contained in path orderable graphs, 
and the independent set problem is known to be NP-complete on triangle-free 
graphs m 
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Abstract. We give a complete characterization of parameter graphs H 
for which the problem of coloring iJ-free graphs is polynomial and for 
which it is NP-complete. We further initiate a study of this problem for 
two forbidden subgraphs. 



1 Preliminaries and Overview of Results 

Graph coloring belongs to the most important and applied graph problems. It 
also belongs to the first identified NP-complete problems. Many classes of graphs 
were shown to allow polynomial-time solution (e.g., interval graphs, chordal 
graphs, etc.). In this paper we aim at classifying the computational complexity 
of this problem when restricted to graphs that do not contain certain forbidden 
induced subgraphs. Related results appear in [8], where 3-colorability is studied. 
We consider, on the other hand, the coloring problem with the number of colors 
being part of the input. For one forbidden subgraph we obtain a complete charac- 
terization of the complexity, which performs the polynomial-time/NP-complete 
dichotomy. First results in the direction of two forbidden subgraphs are gathered 
in the last section, but a complete characterization is not yet at hand. 

We consider finite simple undirected graphs. We say that a graph G is H- 
free, where H is another graph, if G does not contain an induced subgraph 
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isomorphic to H. Typically H will be a fixed “small” graph while G will be part 
of the input of the problem. For a class of graphs A, we say that G is A- free 
if it is H-fi'ee for every H G A. The graphs of A are then also referred to as 
forbidden induced subgraphs in the sense that A is the set of forbidden induced 
subgraphs for the class of ^-free graphs. 

As usual, Kn denotes the complete graph on n vertices, C„ denotes the cycle 
on n vertices and Pn denotes the path with n vertices (i.e., path of length n — 1). 
By G(B H we denote the disjoint union of G and H, and in this sense kG means 
the disjoint union of k copies of G. We use the notation G < H when G is 
isomorphic to an induced subgraph of H . The graph is a cycle of length 
k with a pending edge, e.g. G^ = P^^Ki. The complement of a graph G 
is denoted by G. The chromatic number of G is denoted by x(G). The clique 
covering number of G is denoted by cr(G) (cr(G) = x(G)). 

We study the computational complexity of the following problem: 

iJ-FREE Coloring 

Input: An iJ-free graph G, and an integer k. 

Question: Is x(G) < k? 

Our main result is summarized in the next theorem. 

Theorem 1. The problem iJ-FREE Coloring is polynomial-time solvable if H 
is an induced subgraph of P4 or of P3 © Ki, and NP-complete for any other H . 

We first discuss the polynomial part, then present a useful reduction which is 
easier to be formulated in the complementary version (i.e., as a result on clique 
covering), and complete the proof of the NP-completeness part in Section | 4 ] 
Some ideas towards sets of more forbidden graphs conclude the last section. A 
more detailed account on partial results concerning 2 forbidden induced sub- 
graphs will be given in the full version of the paper. 



2 Polynomial Cases 

All graphs H for which the FI-Free Coloring problem is polynomial-time solv- 
able are depicted in Fig.[TJ If H is an induced subgraph of H' then every H-iree 
graph is also H'-free, and hence IF-Free Coloring oc IF'-Free Coloring. 
Therefore it suffices to prove the polynomial part only for the maximal graphs 
from Fig. [T| i.e. for P4 and P3 © Ffi. 

It is well known that P4-free graphs are perfect (they are exactly the so 
called co-graphs), and hence their chromatic number can be determined in linear 
time [3- 

For graphs which do not contain P3 © K\ we use the following structural 
result of Olariu: 

Proposition 1. ([7]) If G is (P 3 © Ffi)-/ree then every connected component of 
its complement G is triangle-free or is a complete multipartite graph. 
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Ki 2Ki 3Ki P 2 
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^2 0^4^1 P 3 P3©-?^1 P 4 

Fig. 1. The graphs H for which //-Free Coloring is polynomially solvable 



The chromatic number of a graph equals the clique covering number of its 
complement. In this way we determine the chromatic number of a (P 3 0 Ki)- 
free graph in polynomial time. The clique covering number of each triangle-free 
connected component of the complement equals the size of the component minus 
the maximum size of a matching, and thus can be determined in polynomial 
time (e.g., by Edmonds’s algorithm). The clique covering number of complete 
multipartite graphs can also be computed in polynomial time (it equals the size 
of the largest part of the multipartition), and so it follows from Proposition[T]that 
the chromatic number of (P3 © iCi)-free graphs can be computed in polynomial 
time. This concludes the ’’easy” part of the theorem. 

3 Clique Covering of Sparse Graphs 

In this section we present an NP-completeness reduction for coloring graphs 
without forests which consist of short paths. It is more convenient to state 
the result for the complementary problem of clique covering. We denote by 
the graph on 4 vertices with 5 edges. 

Theorem 2. The clique covering problem remains NP-complete even when re- 
stricted to planar {K^, ,Ci^C 5 }-free graphs. 

Proof. We use a reduction from the Satisfiability problem restricted to planar 
formulas, namely a variant in which every clause of the input formula contains 
2 or 3 literals and every variable occurs in exactly 3 clauses, once positive and 
twice negated. A formula is called planar if its bipartite incidence graph is planar. 
NP-completeness of planar formulas was first established by Lichtenstein [^, 
and NP-completeness of the above described restricted version can be found 
e.g. in [ 3 j. 

Given a formula with variable set X and clause set C, we construct a graph 
G as follows. Fix a planar noncrossing drawing of the incidence graph of <T. For 
every variable x € X, the variable gadget is the graph 

Gx = {{x, x^ ,xf ,xf}, {xx~^ , xxf ,xxf ,xf xf}). 
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For every clause c, the clause gadget Gc will be a cycle of length 7. Three 
nonconsecutive vertices of the cycle are used as ports for connecting with vari- 
able gadgets. If X, y, z are variables occurring in c such that in the fixed draw- 
ing, the edges ex, cy, cz leave the vertex c in this clockwise order, we denote 
c(x),c(y),c(z) the ports on Gc also in this clockwise order. (If c contains only 
two literals, we use as ports two vertices of Gc which are at distance two on 
the cycle.) For every variable x, let c be the clause containing x positive and 
di,d 2 the clauses that contain -ix. Then identify vertices c(x) and x~^ into one 
vertex, vertices di(x) and into one vertex and vertices d 2 (x) and into one 
vertex. 



c(v) 




Fig. 2. The construction of variable and clause gadgets 



It is clear that the graph G constructed in this way is planar. The only tri- 
angles occur in variable gadgets, and hence no two triangles of G share an edge. 
Thus G contains neither K 4 nor . Similarly, G does not contain G 4 as an in- 
duced subgraph. In fact, since we may assume that no two clauses share two 
variables, the shortest cycles of G apart of the triangles in variable gadgets are 
the 7-cycles in the clause gadgets. 

We claim that the clique covering number of G is always > |X| -|- 3|C|, with 
equality holding if and only if is satisfiable. 

For every variable x, we need a clique Qx that would contain vertex x € 
V(Gx)- This clique may further contain either X+ (Qx is the edge xx+), or 
x(" and X 2 (if Qx is the triangle xx("x^). Obviously, X+ € Qx implies that Qx 
contains neither x(” nor x^. This establishes a 1-1 correspondence between truth 
assignments of X and the ways in which the vertex x is covered: 

(j){x) = True iff Qx = {xx’*’}. 

Note that a variable x evaluates True in a clause c if and only if the port c(x) 
in Gc is covered by Qx- 

For a clause c, we need at least 3 cliques to cover the vertices of Gc which 
are not ports. Hence a{G) > |X| -|- 3ICI. Moreover, if none of the ports in Gc 
is covered by the appropriate Qx, we need at least 4 cliques to cover this Gc- 
Therefore cr(G) > |X| -|- 3|G| if <P is not satisfiable. 

To see the converse, note that Cy without a vertex is the path Pq, which can 
be covered by 3 edges. Hence if at least one port in Gc is covered by the variable 
clique Qx (i.e., x evaulates True in c), the remaining vertices of such Gc can be 
covered by 3 cliques. This shows that <t{G) = |X| -|- 3|G| if is satisfiable. 
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Corollary 1. The { 2 K' 2 , © 2KTi, 4K'i, Csj-FREE Coloring problem is NP- 
complete. In particular, the H-Free Coloring problem is NP-complete if H 
is a (not necessarily induced) spanning subgraph of 2 X 2 - 

II II II 

2K2 K2 © 2 Ki 4 Ki 



Fig. 3. The minimal linear forests H for which //-Free Coloring is NP-complete 



4 NP— Complete Cases 

In this section we summarize the NP-completeness part of the proof of Theo- 
rem [T] We consider three cases. 

Case 1 Suppose H contains a cycle, say Ck- Let G be an instance of vertex 
3-colorability, which is well known to be NP-complete. 

Let H' be a critical non-3-colorable graph of girth greater than k (i.e., H' 
does not contain cycles of length at most k; existence of graphs of large girth 
and chromatic number is a well known result in graph theory, cf. e.g. W- Take 
an edge ab of H' , delete it and add a new extra vertex a' pending on b. Call 
this new graph H'{a,a'). Since the girth of H' was greater than fc, H'{a,a') 
does not contain Ck and also every path from a to a' has length greater than k. 
Since H' was critical non-3-colorable, H'{a, a') is 3-colorable, but in every proper 
3-coloring, a and b have the same color, i.e., a and a' have different colors. 

Return to G and construct a graph G' by replacing every edge uv by a copy 
of H'{u,v). Thus G' has girth greater than k and it is 3-colorable if and only if 
G is 3-colorable. Hence 3-coloring //-free graphs is NP-complete. 

Case 2 Let H he a forest, and suppose H contains a vertex of degree at least 
3. Then H contains /fi ,3 (the claw). Since 3-coloring is NP-complete for line 
graphs by Holyer [4] and since line graphs are claw-free, /fi 3 -FREE Coloring 
is NP-complete, and so is //-Free Coloring in this case. 

Case 3 Suppose finally that // is a linear forest, i.e., the disjoint union of paths. 

If H has at least 4 connected components then H contains 4/Li and //-Free 
Coloring is NP-complete by Corollary [H 

If H has exactly 3 connected components then either H = 3/Li (in which 
case // < P 3 © Ki and //-Free Coloring is polynomial by Proposition [ID or 
K 2 © 2Ki < H (and //-Free Coloring is NP-complete by Corollary [TJ . 

Suppose H has exactly 2 connected components. If none of them is an isolated 
vertex then 2 K 2 < H. If one of the components is a path of length at least 3 
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then K2 © 2 Ki < H. In these two cases iJ-FREE COLORING is NP-complete 
by Corollary[Tl If, on the other hand, H < P^(B Ki then iL-FREE Coloring is 
polynomial by Proposition [I] 

If H is connected then either H < P4 and iL-FREE Coloring is polynomial 
since every H-iiee graph is a co-graph, or P5 < iL and P-Free Coloring is 
NP-complete by Corollary [T] since 2K2 < H. 

5 Towards Two Forbidden Subgraphs 

A natural generalization of the question of coloring iL-free graphs is as follows. 

Meta-Problem Given a finite set of graphs A, what is the computational 
complexity of deciding the chromatic number of A-free graphs? 

We have completely classified the case of one-element sets A, and the nearest 
goal may be the case of two-element sets, i.e., coloring graphs that do not contain 
two specified induced subgraphs. Here the situation seems more complex and 
we have only obtained partial results. Our proof of Theorem [T] was based on 
two different NP-completeness constructions (our construction in the proof of 
Theorem[2]and Holyer’s reduction) which so far resist attempts to be combined. 

We classify graphs H into four types (Type D is disjoint from Types A,B 
and C, but the latter three are not mutually disjoint): 

Type A graphs containing cycles 

Type B graphs containing a claw (induced copy of Ki^s) 

Type C graphs containing an induced copy of a spanning subgraph of 2K2 
(these are depicted in Fig. E) 

Type D graphs which are induced subgraphs of P4 or of P3 © ATi . 

The following observation says that for determining the complexity of the 
{iLi, iL2}-FREE Coloring problem, we only need to care about cases when 
iLi and H2 are of different types among A, B and C. The three cases are then 
discussed in the subsequent subsections. 

Proposition 2 . Let A he a finite set of graphs. 

(i) If at least one of the graphs of A is of Type D then A-FREE COLORING 
is polynomially solvable. 

(ii) If all graphs of A are of the same type A or B or C, then A-Free 
Coloring is NP-eomplete. 

Proof. If one of the graphs H G A determines a polynomially solvable instance, 
then so does the set A, since every A-free graph is also iL-free. This implies (i) 
by Theorem [T| 

If all graphs of A are of type B (or C), then every claw-free ({2^2, ^2 © 
2ATi, 4ATi}-free) graph is A-free and Holyer’s reduction (or our Proposition [T]) 
apply. 

If all graphs of A are of type A, we perform the reduction described in Case 
1 of Section S] with k being the length of a longest cycle among all graphs of A. 
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5.1 Types B versus C (Forests versus Forests) 

Say Hi is of Type B and H2 is of Type C. Let us assume further that none of 
Hi,H2 is of Type A (the open problems of these cases will be stated below). 
Every forest which contains a claw and at least one more vertex contains a span- 
ning subgraph of 2K2 , regardless of how and if the other vertex is connected to 
the claw. So if Hi ^ ATi 3 then both Hi and H2 are of Type C and the {ili, H2]- 
Free Coloring problem is NP-complete. Thus the only open question remains 
when Hi = iLi.3: 

Problem 1 . Complexity of {H, ATi 3 }-Free Coloring in case H isa linear forest 
which contains a spanning subgraph of 2K2 as an induced subgraph. 



5.2 Types A versus B (Cycles versus Claws) 

Suppose Cfc < Hi and Ki^^ < H2- We first show that all cases with fc > 4 
determine difficult instances: 

Proposition 3. The problem {C^, 3 }-Free Coloring is NP-complete for 
every k> A. Hence it is NP-complete for every pair Hi, H2 such that Ck < Hi, 
k> A, and Ai 3 < H2- 





Fig. 4. Chain of A 4 "’s. 



Proof. Holyer proved that 3-colorability of line graphs of cubic triangle-free 
graphs is NP-complete. Hence 3-colorability is NP-complete for 4-regular claw- 
free graphs such that any two triangles share at most one vertex. Take such 
a graph G, split each vertex into two (in such a way that both triangles containing 
the vertex survive) and connect these two vertices by a sufficiently long chain of 
copies of Kf as indicated in Fig. [H The new graph G' does not contain short 
cycles (apart of triangles) nor it contains claws. The chain of AT^’s enforces that 
in any 3-coloring the split vertices receive the same color, and so the new graph 
is 3-colorable if and only if the original one was. 
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So it remains to consider the case when H\ contains triangles but no cycle of 
greater length. Take again Holyer’s reduction. Line graphs of cubic triangle-free 
graphs are claw-free and any two triangles share at most one vertex. Hence we 
can conclude: 

Observation 3. The problem ,Ki^^-¥ke¥. Coloring is NP-complete. 

This leaves the following open case: 

Problem 2 . Complexity of {iLi, iJ 2 }-FREE Coloring for Hi being a claw-free 
graph containing triangles but no longer cycles, such that every two cycles share 
at most one vertex, and H2 being a forest of maximum degree at least 3 . 

Note that graphs Hi from Problem H have a tree-like structure. If H2 has 
more than 4 vertices it contains a spanning subgraph of 2K2, and hence the 
problem is NP-complete whenever Hi is of Type B as well. So the interesting 
cases are if Hi is a triangle, or a triangle plus an isolated vertex, or a triangle 
with one or two pendant edges. However, if H2 = K13 then the number of open 
cases is larger. Though many of them are shown NP-complete by the reduction 
in the proof of Proposition El many still remain open. A detailed description of 
the NP-complete and polynomial-time solvable cases known to us will appear 
in the full version of the paper. Note also that only cases with forests H2 of 
maximum degree at most 4 remain open, because of Observation El below. 

It may be surprising that excluding either C3 or together with any other 
graph is essentially equivalent: 

Observation 4. The problems {C 3 , iL}-FREE Coloring and {C^,iL}-FREE 
Coloring are polynomially equivalent. 

Proof. Every C;^-free connected graph is a Ca-free graph or a complete mul- 
tipartite graph due to Proposition [H Both complete multipartite graphs and 
Cs-free graphs can be easily recognized in polynomial time. Thus if the problem 
{C 3 , 7 L}-Free Coloring can be solved in polynomial time for a fixed k, we 
split a graph given as an instance of the problem {C^,iL}-FREE Coloring to 
its connected components, and we apply the algorithm of the problem {C^jH}- 
Free Coloring to the components which are Ca-free. The chromatic number 
of its other connected components can be determined easily in polynomial time, 
since each of them forms a complete multipartite graph. The chromatic number 
of the original graph is clearly the maximum chromatic number of its connected 
components. Thus the problem {C;^, iJ}-FREE Coloring can be solved in poly- 
nomial time in this case, too. 

On the other hand, C3 is an induced subgraph of and hence it holds 
that {C 3 , H}-Free Coloring oc iJ}-FREE Coloring, since any instance 
of the problem {C^,iL}-FREE Coloring is also an instance of the problem 
{C3,7L}-Free Coloring. 

We believe that at least the particular case of Hi = C3 in Problem[ 2 l deserves 
separate interest. In other words, we investigate triangle-free graphs. The case 
of a triangle against a star is fully characterized: 



262 



Daniel Krai’ et al. 



Observations. The problems {C3, K'l^fej-FREE Coloring and 
Free Coloring are polynomially solvable for fc < 4 and NP-complete other- 
wise. 

Proof. It is enough to restrict to {C3, i4Ti_fc}-FREE Coloring due to Obser- 
vation 31 Every triangle-free i^Ti 4-free graph has maximum degree 3 and by 
Brooks’s theorem, it is 3-colorable, unless it has K4 as a connected component, 
an easily distinguishable instance. The NP-completeness of {C3, iFi_5}-FREE 
Coloring is proved in [^. 

5.3 Types A versus C (Cycles versus Linear Forests) 

In the last case we assume that Ck < H\ and H2 contains a spanning subgraph 
of 2K2 as induced subgraph. This case is solved if cycles of length greater than 

4 appear in Hi. 

Proposition 4. The problem {Ck, i/2}-FREE Coloring is NP-complete ifk > 

5 and H 2 contains a spanning subgraph of 2 K 2 as an induced subgraph. 

Proof. If k > 6 then 2 K 2 < Ck and the NP-completeness follows from the fact 
that both Hi and H2 are of the same Type C. For fc = 5, NP-completeness 
follows from Corollary [H 

The last open case, however, includes a large number of subproblems and 
its complexity is not well understood. We note that many cases are polynomi- 
ally solvable because of structural reasons (including Ramsey’s Theorem), e.g., 
{A4, 4 Ai}-Free Coloring allows only finite number of input graphs. 

Problem 3. Complexity of {i?i, iJ2}-FREE Coloring if Hi contains only in- 
duced cycles of length 3 and/or 4, and H2 contains a spanning subgraph of 2K2 
as induced subgraph. 
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Abstract. We answer a question of Brandstadt et al. by showing that 
deciding whether a line graph with maximum degree 5 has a stable cutset 
is VP-complete. Conversely, the existence of a stable cutset in a line 
graph with maximum degree at most 4 can be decided efficiently. The 
proof of our VP-completeness result is based on a refinement on a result 
due to Chvatal that recognizing decomposable graphs with maximum 
degree 4 is an VP-complete problem. Here, a graph is decomposable if 
its vertices can be colored red and blue in such a way that each color 
appears on at least one vertex but each vertex v has at most one neighbor 
having a different color from v. We also discuss some open problems on 
stable cutsets. 



1 Introduction 

In a graph, a stable set (a clique) is a set of pairwise non-adjacent (adjacent) 
vertices. A cutset (or separator) of a graph G is a set S of vertices such that 
G — S' is disconnected. A stable cutset (a clique cutset) is a cutset which is also 
a stable set (a clique). 

Clique cutsets are a well-studied kind of separators in the literature, and have 
been used in divide-and-conquer algorithms for various graph problems, such as 
graph coloring and finding maximum stable sets; see m . Applications of clique 
cutsets in algorithm designing based on the fact that clique cutsets in arbitrary 
graphs can be found in polynomial time ([m). 

The importance of stable cutsets has been demonstrated first in [na in 
connection to perfect graphs. Tucker proved that if S' is a stable cutset 
in G and if no induced cycle of odd length > 5 in G has a vertex in S then 
the coloring problem on G can be reduced to the same problem on the smaller 
subgraphs induced by S and the components of G — S. Unfortunately, deciding 
whether a graph has a stable cutset is VP-complete (let Stable Cutset denote 
this problem). This is an easy consequence of a result due to Chvatal |3] on 
decomposable graphs; see also pp. In fact, Chvatal’s results from [3] implies the 
following 

Theorem 1. Stable Cutset is N P -complete, even if the input graph is re- 
stricted to line graphs with maximum degree 6. 
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In [T], Brandstadt et al. then asked whether the degree constraint in Theo- 
rem [T] is best possible. In this paper, we consider this question and prove that 
Stable Cutset is iVP-complete, even for 5-regular line graphs of bipartite 
graphs, and is efficiently solvable for line graphs with maximum degree 4. Also, 
some open problems will be discussed. 



2 Decomposable Graphs Revisited 

A graph is decomposable if its vertices can be colored red and blue in such a 
way that each color appears on at least one vertex but each vertex v has at 
most one neighbor having a different color from v. In other words, a graph is 
decomposable if its vertices can be partioned into two nonempty parts such that 
the edges connecting vertices of different parts form an induced matching. 

Chvatal proved that recognizing decomposable graphs is A^P-complete, 
even for graphs with maximum degree 4. Based on Chvatal’s original proof we 
extend this result . Thereby, we also extend a result of Moshi that recognizing 
decomposable graphs is A^P-complete, even if the input is restricted to bipartite 
graphs of minimum degree 2. 

Theorem 2. Recognizing decomposable graphs is N P -complete, even if the in- 
put is restricted to bipartite graphs with one color class consisting only of vertices 
of degree 3 and the other color class consisting only of vertices of degree 4. 

Proof. A hypergraph is called bicolorable if its vertices can be colored red and 
blue in such a way that no edge is monochromatic. Lovasz jS] proved that rec- 
ognizing bicolorable hypergraphs is an iVP-complete problem, even if the input 
is restricted to hypergraphs with each edge having size 3. Given such a hyper- 
graph p[ we shall construct a graph G such that G is decomposable if and only 
if H is bicolorable. Now observe that the complete bipartite graph A' 2,3 is inde- 
composable. For each vertex v of H, belonging to d edges, take a chain Ay of 
Ad — 1 different complete bipartite graphs AT 2 , 3 ) say D\, ..., D 44-1 each with the 
partite set Gj = {aj, bj,Cj} of degree two vertices for j = 1, ..., Ad — 1, such that 
we identify cj with Gj+i for j = 1, ...,Ad — 2. (Ay is illustrated in Figure 1 for 
d= 1). 




Fig. 1. 



Label one of the vertices of degree 2 in this Ay by v* and the remaining Ad 
vertices of degree two by the distinct labels (v,e,i)Q) with i = 0, 1, j = 0, 1 
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and e running through all the edges with v G e. For each t = 0, 1 take a similar 
chain of 2n + m — 2 complete bipartite graphs ^^ 2,3 with n and m standing 
for the number of vertices and edges in H, respectively. Label n of the vertices 
of degree two in Bi by (v, i)i and n of the vertices of degree two in Bi by (v, 1)2 
with V running through all the vertices of H and label the remaining m vertices 
of degree two in Bi by (e, i) with e running through all the edges of H . For each 
t = 0, 1 and for each edge e = {x, y, z} of H add the graph shown in Figure 2 in 
which the vertices of degree two are labeled. 




Furthermore, for each vertex v of H, add the graph shown in Figure 3 in 
which the vertices of degree two are labeled. 

Note that in each graph described above, vertices of degree 2 are exactly 
the labeled vertices, and that each graph is bipartite with one color class of 
the bipartition consisting only of vertices of degree 3 and the other color class 
consisting only of vertices of degree 2 or 4. 

Now, identify vertices with the same labels. The resulting graph is the desired 
G, a bipartite graph with minimum degree 3, maximum degree 4, all vertices of 
maximum degree 4 form a stable set and likewise all vertices of minimum degree 
3 form another stable set. 

Now consider a decomposition of G. As each complete bipartite graph AT 2,3 
is indecomposable, the n graphs Ay and the two graphs Bi are indecomposable: 
in each of them, all the vertices have the same color. Without loss of generality, 
say that all the vertices of Bg are red. Now all the vertices in B\ must be blue: 
otherwise all Ay would be red [since the (red colored) graph in Figure 3 would 
force red on u*] and then it is not difficult to observe that all vertices of the 
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graphs of Figure 2 type would receive the color red as well and therefore the 
entire graph would be red, a contradiction. 

Next, consider an arbitrary edge e = {x,y,z} of H. At least one of the 
graphs Ax, Ay or A^ must be red [otherwise the graph in Figure 2 would force 
blue on (e, 0)] and at least one of them must be blue [otherwise the graph in 
Figure 2 would force red on (e, 1)]. Hence, the colors on the n graphs Ay define 
a bicoloring of H. Conversely, each bicoloring of H may be used to color the n 
graphs Ay and then extended into a decomposition of G. □ 



V* 




Remark 1. It should be remarked that we eould also use graphs of figure 1 of 
’arbitrary’ length. Henee, for every given S > 0 we ean choose one color class 
containing all vertices of degree 3 and the other color class containing vertices 
of degree 2 and 4 s. t. the average degree is at most (1 + <5)3. 

As Chvatal has shown, the A^P-result is best possible with respect to degree 
constraints: 

Theorem 3 (Chvatal [3j). Decomposable graphs of maximum degree 3 can be 
recognized in polynomial time. 

We will make use of this theorem in proving that Stable Cutset can be 
solved in polynomial time for line graphs of maximum degree at most 4. 
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3 Stable Cutsets in Line Graphs 

As suggested by Cunnigham (see [I]) and shown by Brandstadt et al. the stable 
CUTSET problem for line graphs can be derived by the result of Chvatal on 
decomposable graphs. Recall that the line graph L{G) of a graph G has the 
edges of G as its vertices, and two distinct edges of G are adjacent in L{G) if 
they are incident in G. The relationship between decomposability and a stable 
cutset is 

Proposition 1 ([!]). If L{G) has a stable cutset, then G is decomposable. If 
G is decomposable and has minimum degree at least 2, then L{G) has a stable 
cutset. 

Now observe that if G is a bipartite graph with one color class consisting 
only of vertices of degree 3 and the other color class consisting only of vertices 
of degree 4, then L{G) is a 5-regular graph. With this last proposition, the 
extension of the complexity result of Chvatal as proven in the last section and 
the latter observation we derive the following strengthening of the TVP-complete 
result given in Theorem (T) 

Theorem 4. Stable Cutset is N P-complete, even if the input is restricted 
to 5-regular line graphs of bipartite graphs. 

By Remark [I] at the end of the last section we also obtain the following 
related result. 

Theorem 5. Stable Cutset is N P-complete, even if the input is restricted to 
line graphs of order n and size (2 -|- e)n of bipartite graphs with minimum degree 
at least 2, maximum degree 4, all vertices of degree 3 form an independent set 
and likewise all vertices of even degree form an independent set. 

These results are best possible, since we will demonstrate in the following 
that Stable Cutset is polynomial if the input is restricted to line graphs with 
maximum degree at most 4. 

4 Observations 

As a corollary of Theorem |3| and Observation |3] below. Stable Cutset can be 
solved in polynomial time on line graphs of maximum degree 3. We will see in 
the next sections that Stable Cutset can be solved in polynomial time for 
arbitrary graphs of maximum degree 3 and for line graphs of maximum degree 
4. For this purpose, we need some observations. 

Observation 1. Let S be a stable cutset of G and let v € V{G). If N{v) is not 
a stable set, then G — v has a stable cutset. 



Proof UvGS,S — visa, stable cutset in G — If u belongs to component A 
of G — S', then, as N{v) is not a stable set, |A| > 2, hence S is a stable cutset in 
G-v. □ 
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Observation 2. Let v G V{G), and let S be a stable cutset of G — v. If S (1 
N (v) =0 or N(v) — S belongs to the same component in {G — v) — S, then G 
has a stable cutset. 

Proof. In the first case, S U {t>} is a stable cutset in G, 
a stable cutset in G. 

A vertex z; in G is simplicial if Ng{v) is a clique, 
directly follows from the previous two. 

Observation 3. Assume that G has at least three vertices, and let v be a sim- 
plicial vertex of G. Then G has a stable cutset if and only if G — v has a stable 
cutset. □ 

The following observation admits an argument that will be used in further 
discussions. 

Observation 4. Let v be a vertex of a graph G such that G[7V(z;)] consists of a 
clique G and a vertex a with N{a) fl G C {c} for a vertex c € C. If G — v has a 
stable cutset S, then G has a stable cutset, or we may assume that 

— S' n N{v) = {c}, 

— there are two different components A,B of {G — v) — S with a € A and 
G — {c} C B, and 

— {G-v)-S = ALIB. 

Proof. If N{a)r\G = {c} but SflG {c} then, by Observation!^ G has a stable 
cutset. If N{a) fl G = 0 but S fl G = 0 then, by Observation [2] again, G has a 
stable cutset. Thus, in any case, we may assume that S fl N{v) = {c}. 

If N (v) — {c} belongs to the same component in (G— 1 >) — S then, by Observa- 
tion!^ G has a stable cutset. Thus we may assume that a belongs to component 
A, G — {c} belongs to another component B of {G — v) — S. 

If {G — v) — S has a third component then clearly S is a stable cutset in 
G. □ 

5 Stable Cutsets in Graphs with Maximum Degree 3 

We show in this section that Stable Cutset can be solved in polynomial time 
on (not necessarily line) graphs with maximum degree at most 3. We discuss the 
non-trivial case where the input graph G has maximum degree 3 and at least 7 
vertices. Furthermore, we may assume that every vertex of G has a non-stable 
neighborhood (otherwise we are done), and that G has no simplicial vertex (by 
Observation 0). 

Consider a vertex v of degree 3, i. e. a vertex attaining the maximum degree 
3. We will show that G has a stable cutset if and only if G — has a stable 
cutset. By Observation [T] we need only to show the if part. Thus, let S be an 
inclusion- minimal stable cutset in G— We will show that G has a stable cutset. 

As N{v) is not a stable set and not a clique, it consists of three vertices a, b, c 
such that b and c are adjacent and a is non-adjacent to b. By Observational we 



in the second case, S is 
□ 

The observation below 
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may assume that S fl N{v) = {c}, and {G — v) — S has exactly two components. 
Let A, B be the components oi {G — v) — S with a & A and b G B. 

Case 1: a is adjacent to c. 

If A ^ {a} then the neighbor of a in A — {a} and the vertex b form a stable 
cutset in G disconnecting a and the neighbor of b different from v and c (such 
a neighbor of b exists because b is not simplicial). Similarly, if B ^ {6}, G has a 
stable cutset. Thus we may assume that A = {a} and B = {6}. Then, as G has 
at least 5 vertices, l^l > 2, hence {a,b} is a stable cutset in G disconnecting v 
and S — {c}. Case 1 is settled. 

Case 2; a is non-adjacent to c. 

By the minimality of S, c has (exactly one) neighbor c' G A. If a has no neighbor 
in S', 50 {a} is a stable cutset in G disconnecting c' and v. So, let us assume that 
a has (exactly one) neighbor a' G S. By the minimality of S, a' has a neighbor 
a" e B. 

If a” ^ b, then b has exactly one neighbor b' G B — {6}, and {a,c,b'} is a 
stable cutset in G disconnecting b and c' . So, we may assume that a" = b, hence 
B = {b}. Now, if a' and c' are non-adjacent, then {a',c',u} is a stable cutset 
in G disconnecting a and b, and if a and c' are non-adjacent, then {a, 6, c'} is a 
stable cutset in G disconnecting v and o'. The case a'c' and ac' are edges of G 
cannot occur because G has at least 7 vertices. Case 2 is settled. 

Thus, for every vertex v of maximum degree 3, G has a stable cutset if and 
only if G — V has a stable cutset. Repeat the same reduction on G' = G — z; if G' 
still has maximum degree 3 and at least 7 vertices. In this way, we can determine 
in linear time whether G has a stable cutset. Therefore we obtain 

Theorem 6. Stable Cutset can be solved in linear time on graphs with max- 
imum degree at most 3. □ 

6 Stable Cutsets in Line Graphs of Maximum Degree 4 

Let L = L{G) with maximum degree A = A{L) = 4 and at least 13 vertices. 
We shall reduce L to a line graph L* such that L* = L{G*), and 

(i) L has a stable cutset if and only if L* has a stable cut set, and 

(ii) G* has maximum degree at most 3 and minimum degree 2. 

Thus, by Proposition [I] and Theorem we obtain 

Theorem 7. Stable Cutset ean be solved in polynomial time on line graphs 
of maximum degree at most 4. □ 

The reduction 

First, by Observation [31 we may assume that L has no simplicial vertex. Let v 
be a vertex of L of maximum degree d(y) = 4, and let xy be the corresponding 
edge in G. We call v bad if (exactly) one of x,y is of degree 4 in G (cc or y 
cannot have degree 1 in G otherwise v would be a simplicial vertex in L) . If no 
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vertex of degree 4 in L is bad, then G has maximum degree 3 and we are done 
by Theorem 

Claim; If L has a bad vertex v of degree 4 then L has a stable cutset if and 
only if L — V has a stable cutset. 

Proof: See the full version of the paper. 

Note that L' := L — v is again a line graph. Repeat the same reduction for L' 
instead of L if L' still has a bad vertex of degree 4 and at least 13 vertices. In 
this way we obtain the desired line graph L* . 

The root graph G* with L* = L{G*) can be determined in linear time (see 

iw np- 

7 Concluding Remarks 

We have shown that Stable Cutset is NP-complete for 5-regular line graphs of 
bipartite graphs, and that Stable Cutset can be solved in polynomial time for 
line graphs of maximum degree at most 4 and for arbitrary graphs of maximum 
degree at most 3. Thus, it would be interesting to fill the gap in the complexity 
status of Stable Cutset on arbitrary graphs of maximum degree 4. 

Problem 1. Is Stable Cutset in P or NP-complete on graphs with maxi- 
mum degree 4? 

We are going to explain a connection between this problem and another 
interesting result on stable cutsets. The following nice theorem was conjectured 
by Caro and has been proved by Chen and Yu 0. 

Theorem 8 ([2|). Every graph with n vertices and at most 2n — 4 edges has a 
stable cutset. □ 

(This theorem implies that every graph with at least 8 vertices and maximum 
degree at most 3 has a stable cutset, therefore Stable Cutset is in P for graphs 
with maximum degree at most 3; cf. also our discussion in Section 5 without use 
of Theorem [^) 

Now, let G be a graph with n vertices and maximum degree 4. Then G has 
at most 2n edges. By Theorem [HI Problem [1] remains open only in four cases, 
namely for graphs with n vertices and m edges where 2n — 3 < m < 2n. 

Thus, the following problem is of interest. 

Problem 2 (SCS(n,m)). Given a graph G with n vertices and at most m 
edges. Does G have a stable cutset? 

Theorem IH] shows that SCS(n, 2n — 4) is in P, while our Theorem |5]implies 
that, even on line graphs, SCS(n, (2-|-e)n) is fVP-complete for every fixed e > 0. 
A question arises: What is the maximum number m (depending on n) for which 
SCS(n, m) can be solved efficiently? In particular, is SCS(n, 2n — 3) in PI 
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Abstract. Motivated by parallel routing in networks with faults, we 
study the following graph theoretical problem. Let G be a d-regular 
graph. We say that G is strongly Menger- connected if for any copy G/ 
of G with at most d — 2 nodes removed, every pair of nodes u and v in 
G/ are connected by vam{degf{u),degf(v)} node-disjoint paths in G/, 
where degf{u) and degf{v) are the degrees of the nodes u and v in G/, 
respectively. We show that the star graphs, which are a recently proposed 
attractive alternative to the widely used hypercubes as network models, 
are strongly Menger-connected. An algorithm of optimal running time is 
developed that constructs the maximum number of node-disjoint paths 
of nearly optimal length in star graphs with faults. 



1 Introduction 

Parallel routing (i.e., construction of node-disjoint paths) has been an important 
issue in the study of computer networks. Routing by node-disjoint paths between 
nodes can not only avoid communication bottlenecks, thus increase the efficiency 
of message transmission, but also provide alternative paths in case of node fail- 
ures. By the famous Monger’s Theorem m, if a network G is a d-connected 
graph, then every pair of nodes in G are connected by d node-disjoint paths. 

With the continuous increasing in network size, routing in networks with 
faults has become unavoidable. Suppose that the network G has a set Sf of faulty 
nodes. Let u and v be two non-faulty nodes in G. Based on local information, 
we know the numbers degf{u) and degf{v) of non-faulty neighbors of u and z;, 
respectively, and are interested in constructing the maximum number of node- 
disjoint fault-free paths between u and v. Obviously, the number of node-disjoint 
fault-free paths between u and v cannot be larger than miii{degf{u),degf{v)}. 
We are interested in the precise bound on the size of the faulty node set S / such 
that for any two non-faulty nodes u and v in G, there are min{deg/(zt), degf{v)} 
node-disjoint fault-free paths connecting u and v. Note that this problem has a 
strong similarity to that of Monger’s Theorem. 

Since most popular network topologies are regular graphs, we will concentrate 
on regular graphs. Suppose that the network G is a d-regular graph (i.e., all nodes 
of G have degree d), then in general the number of faulty nodes in the set Sf 
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should not exceed d — 2 to ensure min{degf{u),degf{v)} node-disjoint fault-free 
paths between any two nodes u and v. This can be seen as follows. Let u and 
V be two nodes in G whose distance is larger than 3. Pick any neighbor u' of u 
and let the other d — 1 neighbors of u' become faulty. Note that no neighbor of 
u' can be a neighbor of v since the distance from u to u is at least 4. Now each 
of the nodes u and v has d non-faulty neighbors. However, there are obviously 
no d node-disjoint fault-free paths in G from u to v since one of the d neighbors 
of u, the node u', leads to a “deadend” . This motivates the following definition: 

Definition 1. A d-regular graph G is strongly Menger-connected if for any copy 
Gf oi G with at most d — 2 nodes removed, each pair u and v of nodes in G/ are 
connected by mm{degf{u),degf{v)} node-disjoint paths in Gf, where degf{u) 
and degf{v) are the degrees of the nodes u and v in Gf, respectively. 

Because of its motivation, in case the graph is strongly Menger-connected, 
we are also interested in constructing the node-disjoint paths so that the path 
lengths are minimized. To authors’ knowledge, there has not been a system- 
atic study on this issue on popular network structures. Some related research 
includes Galil and Yu’s work on constructing short node-disjoint paths in a gen- 
eral graph m and Hsu’s work on graph containers m- However, neither of 
them considered graphs with faulty nodes. 

We will discuss the strong Menger-connectivity in detail on star graphs [2], 
which have received considerable attention recently as an attractive alternative 
to the widely used hypercubes as network models, because of their rich struc- 
ture, smaller diameter, lower degree, and symmetry properties prra] . We provide 
another advantage for star graphs in this paper by showing that star graphs are 
strongly Menger-connected and developing an algorithm of optimal running time 
that constructs the maximum number of node-disjoint paths of nearly optimal 
length between any two nodes in the n-star graph with at most n — 3 nodes 
removed (note the n-star graph is (n — l)-regular). 

We briefly mention the related research on star graphs. Constructing a single 
path between two nodes in star graphs with faults has been studied extensively 
| l|2 |3| 4 | ll | 18j . Constructing node-disjoint paths in star graphs without faulty 
nodes has also been studied | I6I7I8I9|12| . In particular, an efflcient algorithm has 
been developed that constructs the maximum number of node-disjoint paths 
of optimal length for any two given nodes in star graphs. A randomized algo- 
rithm, based on the Information Dispersal Algorithm (IDA) [16], for constructing 
node-disjoint paths in star graphs with faults was proposed in m- 

2 Preliminaries 

A permutation u = (ai 02 • • • a„) of (12 • • • n) can be given by a product of disjoint 
cycles [5], which is called the cycle structure of u. A cycle is trivial if it contains 
only one symbol. The cycle containing the symbol 1 will be called the primary 
cycle. For example, the permutation (32541) has the cycle structure (351)(2)(4), 
where the primary cycle (351) indicates that 3 is at I’s position, 5 is at 3’s 
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position, and 1 is at 5’s position. The trivial cycles (2) and (4) indicate that 2 
and 4 are in their “correct” positions. 

We define a group of operations pi on permutations for 1 < z < n. For any 
permutation u, Pi(u) is the permutation obtained from u by exchanging the first 
symbol and the zth symbol in u. Let us consider how these operations change 
the cycle structure of a permutation. Write u in its cycle structure 

u = (an • • • l)(o 2 i • • • a 2 n^) ■ ■ ■ {aki ■ ■ ■ afe„J 

If the zth symbol a of it is not in the primary cycle, then Pi{u) “merges” the 
cycle containing a into the primary cycle. More precisely, let a = 021 (note each 
cycle can be cyclically permuted and the cycle order is irrelevant), then 



Pt(lt) — (tt21 * * ' a2n2^11 ’ ’ * 1)(^31 ‘ ‘ * ^3ri3 ) * * ‘ (^fcl ’ ’ ‘ ^kn^) 



If the ith symbol a is in the primary cycle, then pi{u) “splits” the primary cycle 
into two cycles. More precisely, suppose that a = a\j, where 1 < j < ni + 1 and 
we have let = 1, then 

Pi{u) = (an • • • aij_i)(oij • • • ai„^l){a2i ■ ■ ■ 02 ^ 2 ) ’ ' • («fci • ’ ' 

In particular, if a = 012 , then we say that the operation pi “deletes” the first 
symbol an from the primary cycle. 

The n-star graph Sn is an undirected graph consisting of n\ nodes labeled 
with the n! permutations on (12- --n). A node u in has n — 1 neighbors 
Pi(u), 2 < i < n. The n-star graph S'„ is vertex-symmetric [2]. Thus, a set of 
node-disjoint paths from a node ic to a node v can be mapped to a set of node- 
disjoint paths from a node it to e in a straightforward way, where e is the identity 
permutation (12 • • • n). Thus, we only need to concentrate on node-disjoint paths 
from u to e in 5'„. Denote by dist(u) the distance from it to e. 

It has been known |B] that the following are necessary and sufficient rules for 
tracing a shortest path from a node it to the node e in the n-star graph 5'„: 

Shortest Path Rules (briefly sp-rules) 

1. If It has a trivial primary cycle, then in the next node on a shortest 
path from it to e, a nontrivial cycle is merged into the primary cycle. 

2. If It has a nontrivial primary cycle (anai 2 ■ ■ • ai„^l), then in the next 
node on a shortest path from it to e, either another nontrivial cycle is 
merged into the primary cycle, or the first symbol an is deleted from 
the primary cycle. 

Remark 1. If an edge [it, v] in Sn does not follow the sp-rules, then dist{v) = 
dist{u) -I- 1 [SI - Thus, the length of a path from it to e in which exactly k edges 
do not follow the sp-rules is equal to dist{u) + 2k. 

For the n-star graph Sn, let 5'ri[i] be the set of nodes in which the symbol 1 
is at the ith position. It is well-known |T] that the set S'„[l] is an independent 
set, and the subgraph induced by the set 5„[i] for i 1 is a (n — l)-star graph. 
Note that a node is in the substar S'„[i], i yf 1, if and only if the primary cycle 
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of the node is of form (• • • il), and a node is in if and only if the primary 

cycle of the node is trivial. 

Two simple procedures will be used in tracing a shortest path from a 

node u to e. The first is the “Delete” procedure, written as — >■ — >■, which 

repeatedly deletes the first symbol in the nontrivial primary cycle. The second is 

the “Merge-Delete” procedure, written as — — >•, which works in two stages: 
first repeatedly merges in an arbitrary order each of the nontrivial cycles into the 
primary cycle, then repeatedly deletes the first symbol in the primary cycle. It is 
easy to verify that both procedures follow the sp-rules strictly. A nice property 
of the Delete and Merge-Delete procedures is that if they start with a node u 
in the substar i fy 1, then all nodes, possibly except the last one, on the 

constructed shortest path are also in the substar [i] . 

3 Bridging Paths from a Node to a Substar 

Our algorithm is based on the concept of bridging paths that connect a given 
node to a specific substar in the n-star graph. In this section, we give formal 
definitions for bridging paths, and study its properties. 

Lemma 1. Let i > 1 and 5'“[z] be the substar with ki < n — S nodes 
removed, Pi{s) G [fy node u in S~[i], a path P from u to Pi{e) in 

can be constructed in time 0{kiU + n) with at most two edges (in case the 
primary cycle of u is (*1), one edge) not following the sp-rules. 



Definition 2. Let u be a node in the n-star graph S'„ and u' be a neighbor 
of u in the substar f fy 1. For each neighbor v of u', u fy u, a (u',j)- 

bridging path P from u to j fy l,z, is defined as follows: if v is in S'„[l] 

then P = [u, u', v, Pj{v)], while if v is in then P = [n, u' , v, Pi{v), pj{pi{v))]. 

Thus, from each neighbor u' in of the node u, * fy 1, there are n — 2 
(fy, j)-bridging paths of length bounded by 4 that connect the node u to the 
substar Sn[j]- See Figure [T] for an intuitive illustration for bridging paths. 





Fig. 1. Bridging paths from node u to substar S'n[i]: (A) u is in Snll]; (B) u is in 5n[i] 
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Since no two nodes in share the same neighbor in S'„[l] and no two 
nodes in share the same neighbor in Sn[j], for any neighbor u' of u, two 

(it', j)-bridging paths from u to Sn[j] have only the nodes u and it' in common. 
Moreover, for any two neighbors it' and it" of it in (in this case, the node 
It must itself be also in since it' and it" have no other common neighbor 

except It (see, for example, |7l8p . a (it', j)-bridging path from it to Sn[j] and a 
(it", j)-bridging path from it to Sn[j] share no nodes except it. 

Definition 3. Let it in a node in S'„ and it' be a neighbor of it in i 1. A 
(it', j)-bridging path P from it to Sn[j] is divergent if in the subpath of P from 
It to <S'n[l], there are three edges not following the sp-rules. 

Note that the subpath from u to S'„[l] of a (it', j)-bridging path P contains 
at most three edges. In particular, if the subpath contains only two edges, then 
the path P is automatically non-diver gent. 

Lemma 2. A divergent (u' , j) -bridging path P from a node u to a substar Sn[j], 
where m' G i 1, j, can be extended into a path Q from u to Pj{s) in time 

0 {n), such that at most 4 edges in Q do not follow the sp-rules and the extended 
part is entirely in Sn[j]- Moreover, for two divergent {u' , j) -bridging paths, the 
two corresponding extended paths only share the nodes u, it', and Pj{e). 

Proof. Let P be a divergent (it', j)-bridging path from the node u to the substar 
Sn[j], where u' is a neighbor of it. Since P is divergent, it has length 4. Thus, the 
path can be written as P = {it, it', n, n', d"}, where it' is in i> is a neighbor 

of u' in Sn[i], v' = Pi{v) is in and v" = Pj{v') is in Sn[j]- 

Let It' = {ai ■ ■ ■ Gpil) * *, where “* *” stands for “other cycles”. Since the 
edge [it' , i>] does not follow the sp-rules and v is in Sn [*] , the node v must have 
the form either v = (bai ■ ■ ■ Opil) * *, where (6) is a trivial cycle in it', or i; = 
(fli • • • aq){aq+i ■ ■ ■ Upil) + *, where 2 < q < p. Now since [i>, v'] is an edge in Sn 
and v' is in S'n[l], the node v' must be of the form either v' = {bai ■ ■ ■ api){l) * *, 
or v' = (oq+i • • • api)(l) * *. Moreover, since the edge [ii, i^'] does not follow the 
sp-rules, when v' = (a^+i • • • api)(l)**, we must have q-|-l < p. In summary, if P 
is a divergent path, then the fourth node v' on P must be of form (&162 ’ • • *)(l)i 
where the cycle (61 &2 ■ • • i) is nontrivial. Moreover, the (it', j)-bridging path P is 
distinguished from other divergent (11', j)-bridging paths by the symbol b\ in the 
above format (i.e., two different divergent (it', j)-bridging paths will have two 
different symbols bi in the above format). 

Now consider the fourth edge [v' ,v"\ on P, where v" is in 5'ri[j], j y^ l,i. 

If j is in a trivial cycle in v' , then the extended path Q is 

It — >■ It' — >■ 1; — >■ ii' = (6162 • • • i)(l) * * — >■ ii" = (6162 • • • i)(jT) **—>■ 

^ (62 • • • ibijl) * * -)> {hijl) (jl) = pj{e) ( 1 ) 

The extended path Q has no common nodes in Sn[j], except Pj{s), with the paths 
extended from the other (it', j)-bridging paths since the symbol bi distinguishes 
the path Q from other extended paths: the first part of Q has a unique cycle 
(61 &2 • • • i) while the second part of Q has a cycle of the unique format (• • • bijl). 
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If j is in a nontrivial cycle in v' , then there are two possible cases: 

Case 1 . j is not in the cycle (6162 ■ ■ - i). Then the extended path Q is 

u — >■ rt' — >■ z) — >■ v' = (6162 • • • *)( 1 ) * * ^ v" = (6162 •••*)(•• • jT) **—>■•••—>■ 

^ (&162 • • • *)(jl) * * -)> (62 • • • ibijl) -)> {hijl) -)> (jl) = /Oj(e) (2) 

Again, because of 61, the extended path Q has no common nodes in Sn[j], except 
Pj{e), with the paths extended from the other (zt',j )-bridging paths. 

Case 2 . j is in the cycle (6162 ■ ■ - i). 

If j = bi, then (6162 • • • i) = (62 • • • ij), and the path Q is 

It — >■ u' — >■ u — >■ u' = (62 • • • u)(l) * * — >■ u" = (&2 ■ • • *jT) * * — >■ 

— ^ • • • — ^ (ul) — >■ (jl) = Pj{s) ( 3 ) 

This path is node-disjoint from the paths extended from the other (u',_j)-bridging 
paths because all nodes of it in ^nb’] contain a cycle of form ■ -ijl). 

If j yf &i, then (6162 ■ ■ ■ i) = {bi ■ ■ ■ j ■ ■ ■ i), and the path Q is 

u ^ u' ^ V ^ v' = {bi ■ ■ ■ j ■ ■ ■ i){l) * * ^ v" = {■ ■ ■ ibi ■ ■ ■ jl) **—>■ 

{■■■ibi-- jijl) **-)>(••• ibijl) * * -)> (6ijl) -)■ (jl) = pj{e) ( 4 ) 

Again this path is node-disjoint from the paths extended from the other (u' ,j)~ 
bridging paths because of the symbol b\. 

For all cases, we can easily verify that the path Q contains at most 4 edges not 
following the sp-rules, and that the part of Q extended from the (u', j)-bridging 
path P is entirely in the substar S'nb]- Finally, from the sequences (H])-(I 11 ) , it 
can be easily see that the construction of the path Q takes time 0 {n) and is 
independent of the construction of the other paths. □ 

4 The Strong Menger-Connectivity of Star Graphs 

We are ready to present our main algorithm. Let S~ be the u-star graph 
with at most n — 3 nodes removed such that e G S~ . Denote by degf{u) the 
degree of a node u in S~. The algorithm is presented in Figure [21 

Theorem 1 . The algorithm M- connectivity runs in time 0 {n^) and con- 
structs Tain.{degf{u),degf{e)'\ node-disjoint paths in S~ of length hounded by 
dist(u) -\- 8 from the node u to the node e. 

Proof. Step 1 of the algorithm constructs paths between some neighbors of the 
nodes u and e. Step 2 of the algorithm maximally pairs the rest neighbors of u 
and e, and Step 3 constructs a path from u to e for each of these pairs. Thus, 
the number of paths constructed by the algorithm M-connectivity is exactly 
min{degf{u),degf{e)}. What remain to discuss are the node-disjointness and 
the length of these paths, and the complexity of the algorithm. 
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First consider Step 1 of the algorithm. In case u is in for each index 

j ^ 1 with both pj (u) and pj (e) in S~ , by Lemma [T] we can construct in time 
0{kjn + n) a path Qj from Pj{u) to Pj{e) in S~ that is entirely in Sn[j] such that 
at most two edges in Qj do not follow the sp-rules, where kj < n— 3 is the number 
of nodes removed from Now the path Pj from u through Qj to e has at 

most three edges do not follow the sp-rules (the edge [pj (e) , e] always follows the 
sp-rules). By Remark 1, the length of Pj is bounded by dist{u) -|- 6. This path 
Pj is disjoint with other paths constructed in Step 1 because all internal nodes 
of Pj are in Sn[j]- 



M-connectivity 

Input: a node u in (the n-star graph Sn with at most n — 3 nodes removed) 
Output: min{degf{u),degf{e)} paths from u to e in . 

1. if the node u is in S„[l] 

1.1. then for each j ^ 1 with both pj{u) and Pj{e) in do 

construct a path Pj of length < dist(u) -f 6 from u to e such 
that all internal nodes of the path are in Sn\j]', 

1.2. else (* the node u is in a substar Sn[i], i 1 *) 

1.2.1. if the node pi{e) is in S'” 

then pick a neighbor u of u and construct a path of length 

< distiu) -\- 4 from u to e via v such that all internal nodes of 
are in Sn[*] and P„ does not intersect a («', j)-bridging 
path for any neighbor u' ^ v oi u and for any j yf 1, i; 

1.2.2. if the node ui = pi{u) is in S'" 

then find a j, j ^ l,i, with both Pj{ui) and pj{e) in S^; extend the 
path [u,ui, pj{ui)] to a path Pi of length < dist(u) -\- 8 from u 
to e such that all nodes between Pj{ui) and Pj{e) are in S„[j']; 

2. maximally pair the remaining neighbors of u and e; 

3. for each pair {ph{u), pj{e)) constructed in step 2 do 

3.1. if there is a non-divergent (p;i(u), j)-bridging path with no used nodes 
then pick this non-divergent (p/,(u), j)-bridging path P 

else pick a divergent (p;i(u), j)-bridging path P with no used nodes; 

3.2. extend the path P into a path Pj of length < dist(u) -|- 8 from u 
to e such that the extended part of Pj is entirely in Sn[j’]; 



Fig. 2. Node-disjoint paths in S„ with < n — 3 nodes removed 



In case m is in a substar f yf 1, Step I constructs at most two paths 

from u to e. If the node Pi(e) is in S~ , by Lemma [U we can construct in time 
0{kin + n) a, path Qy in S~ fl 5'„[z] from u to Pi{e) (where v is the second 
node on QQ such that at most two edges in do not follow the sp-rules, 
where < n — 3 is the number of nodes removed from Sn [i] . The path Qy plus 
the edge [pi{e),e\ gives a path Py of length bounded by dist{u) -|- 4 in which 
all internal nodes are in We show that the path Py can be constructed 

without intersecting a (it', j)-bridging path from it, for any neighbor it' yf i; of 
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u and any j ^ l,i. Suppose the contrast that intersects such )-bridging 
paths. Let w be the last node on Py that belongs to such a (u', j)-bridging path 
Qy! from u. Note that the neighbor u' of u is uniquely determined by the node w 
since for two different neighbors u' and u” of u in a («', j')-bridging path 

and a (u", /')-bridging path have no common nodes except u. Now, if we let Pu' 
be the subpath of Qu' from u to w plus the subpath of from w to e, then it 
is not hard to see that the length of the path P„/ is not larger than the length 
of the path Py, and that the path does not intersect any (u", j")-bridging 
path from u for any neighbor u" ^ u' ol u and for any j” ^ 1, L 

If the neighbor u\ = Pi{u) of u is in S~ , consider the n— 2 pairs (pj(iti), Pj{e)) 
of neighbors of u\ and e, where j yf l,i. Since at most n — 3 nodes are removed 
from Sn, one of these pairs (pj{ui), pj{e)) has both nodes in S~ . By Lemma Q] 
a path Qi from Pj{ui) to Pj{e) can be constructed in the substar Sn[j] in time 
0{kjn + n) such that at most two edges of Qi do not follow the sp-rules, where 
kj < n — 3 is the number of nodes removed from Now the concatenation 

of the path [u,u\, pj{ui)]^ the path Qi, and the edge [pj{e),e] gives a path in 
S~ from u to e of length bounded by dist{u) + 8 in S~ . Note that this path is 
obviously node-disjoint with the path constructed in Step 1.2.1. 

Now we consider Step 3 of the algorithm. 

Case 3 . 1 . The node u is in S'„[l]. 

For each pair {ph{u), pj{e)) formed in Step 2, the nodes Ph{s) and pj{u) must 
be removed: otherwise the index h or j would have been picked in Step 1.1. 

We construct a path Qhj from u to Pj{e) by concatenating a {ph(u),j)- 
bridging path from u to [j] with a path Q' entirely in the substar S'„ [j] . Note 
that such a path Qhj contains one node in .Sn[l], besides u, and all other nodes 
in Sn[h] and Sn[j]- We say that a node ui yf u in S'„[l] is used if ui has been used 
by a path constructed by the algorithm. Inductively, assume that for r pairs of 
neighbors of u and £ constructed in Step 2, the r node-disjoint paths have been 
constructed. We consider the {r + l)st pair {ph{u), Pj{e)). 

Each {ph'{u), pj'{e)) of the previous r pairs implies two removed nodes, the 
nodes Ph'{s) and Pj'(u), and one used node in S'„[l]. Also notice that the paths 
constructed in Step 1.1 do not use any nodes in S'„[l]. Thus, the number of 
removed nodes in the sets -Snll], Sn[h], and Sn[j] is at most (n — 3) — 2r = 
n — 2r — 3. Let kj be the number of removed nodes in Sn[j], kj < n — 2r — 3. 

case 3 . 1 . A. there is a non-divergent (p/j(m), j)-bridging path Phj for u with 
no used nodes. At least one of the first three edges on Phj follows the sp-rules. 
Consider the last edge [v' ,v"\ on Phj 

If the edge follows the sp-rules, then the path Phj has at most two 

edges not following the sp-rules. By Lemma [H we can construct a path Qt in 
the substar from v” to Pj{e) in time 0{kjTi + n) with at most two edges 

not following the sp-rules. Now the concatenation of the (p/i(u), j)-bridging path 
Phj^ the path Q', and the edge [pj{e),e\ gives a path Pj from u to e with at 
most 4 edges not following the sp-rules, whose length is bounded by dist{u) + 8. 

If the edge \v' ,v"] does not follow the sp-rules, then the path Phj may have 
three edges not following the sp-rules. Since v' is in 5'„[1], its primary cycle is 
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trivial. Now since v" is in and the edge \v' ^ v”] does not follow the sp-rules, 
the primary cycle of v" must be (jl). By Lemma[Tl a path Q' from v" to Pj{s) in 
Sn[j] can be constructed in time 0{kjn + n) with at most one edge not following 
the sp-rules. Now the concatenation of the (p/i(rt), j)-bridging path Phj, the path 
Qj, and the edge [pj{e),e] gives a path Pj from u to e with at most 4 edges not 
following the sp-rules, whose length is bounded by dist{u) + 8. 

Therefore, in case 3.1. A, for the pair {ph{u), pj{e)), we can always construct, 
in time 0{kjn + n), & path Pj with no used nodes and of length bounded by 
dist(u) + 8 from node u to node e. This path is node-disjoint with all previously 
constructed paths since the part extended from the (pft,(u), j)-bridging path Phj 
is entirely in the substar Sn[j] that is not used by any other paths. 

case 3.I.B. all non-divergent (p?i(u), j)-bridging paths contain used nodes. 

There are totally n — 2 (p;i(u), j)-bridging paths from u in S'„. Suppose that 
q' of them contain either removed nodes or used nodes, and that q = n — 2 — q' 
of them contain neither removed nodes nor used nodes. We show q > 0. 

Assume q — 0, so q' = n — 2. Since any two (p/j(m), j)-bridging paths from u 
have only the nodes u and Ph(u) in common and at most n — 3 nodes are removed 
from Sn, > 0 of the n — 2 (p;,(it), j)-bridging paths contain only used nodes. 
Each of the rest q '2 = q' — q[ = n — q[ — 2 (p?i(u), j)-bridging paths contains at 
least one removed node. Thus, at least q[ paths have been constructed by the 
algorithm for q[ pairs (note that each constructed path uses exactly one node in 
S'„[l]). Each (ph'{u), pj>{e)) of these pairs implies two removed nodes Pj'{u) and 
Ph'{s). Thus, the total number of removed nodes in the n-star graph Sn would 
have been at least q '2 + 2g( = n + q'l — 2 > n — 3, contradicting the assumption 
that at most n — 3 nodes are removed from Sn- This shows 9 > 0, i.e., there is 
at least one (ph(w), j)-bridging path in S~ containing no used nodes. 

By the assumption, the q (p?,(u), j)-bridging paths without used nodes in S~ 
are all divergent. By Lemma[^ these q (/5?i(u),j (-bridging paths can be extended 
into q paths from u to Pj{e) with at most 4 edges not following the sp-rules. The 
constructed paths contain no used nodes since the extended part of each path 
is entirely in the substar Moreover, no two of these q paths share a node 

that is not u, Ph(u), and Pj{s). 

We claim that at least one of these q extended paths contains no removed 
nodes. To the contrary, if each of these q extended paths contains at least one 
removed nodes, then the total number of removed nodes in the sets Sn[i], 

and Sn \j] is at least q + {q' — r) = n — r — 2 > n — 2r — 3 (recall that r is 
the number of paths that have been constructed by the algorithm so far. Thus, 
among the q' (p/j(m), j)-bridging paths that contain either removed nodes or used 
nodes, at least q' — r of them must contain at least one removed node each). 
This contradicts the fact that there are at most n — 2r — 3 removed nodes in the 
sets and Sn[j]- 

Thus, an extended path Q'f^j from u to Pj{e) in S~ without used nodes can 
be constructed. This path Q'f^j plus the edge [pj{e),e\ gives a path Pj from u to 
e with no used nodes and with at most 4 edges not following the sp-rules. Thus, 
the length of Pj is bounded by dist{u) + 8. Moreover, Pj can be constructed in 
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time 0{kjn + n) by tracing at most kj + 1 of the extended paths from u to Pj{s). 
Finally, this path is node-disjoint with all previously constructed paths since its 
extended part is entirely in Sn[j], which is not used by any other paths. 

Case 3.2. The node u is in the substar * yf 1. 

In this case, the node u has one neighbor in S'n,[l], and n — 2 neighbors in 
(see Figure [T]). Note that if the neighbor pi{u) of u in S'„[l] is in S~ , then 
a path from u to e via Pi{u) has been constructed in Step 1.2.2. Thus, we only 
need to consider the neighbors of u that are in [i] . 

Again we assume that the algorithm has constructed r paths from u to e by 
extending r bridging paths from u. Consider the (r -|- l)st pair {ph{u), Pj{e)). 

Since the n-star graph contains no cycle of length less than 6 |7|8J . two 
neighbors of u share no common neighbors except u. Let ui and U 2 be two 
neighbors of u in Since no two nodes in have the same neighbor in 

and no two nodes in have the same neighbor in Sn[j], a (ui,ji)- 

bridging path and a (u 2 ,j 2 (-bridging path share no common nodes except u for 
any ji and j 2 - Therefore, for the previous r paths from u to e constructed by 
the algorithm by extending bridging paths from u, none of them would intersect 
a (u', j)-bridging path. Thus, no (u', j)-bridging path contains used nodes. 

Thus, if there is a non-divergent (/9/j(u), j (-bridging path Phj without re- 
moved nodes, we can extend the path Phj, in the way of Case 3.1. A, into a path 
Pj from M to e such that the length of the path Pj is bounded by dist(u) + 8 
and the extended part of Pj is entirely in Sn[j]- On the other hand, if all non- 
divergent (p/i(u(,j (-bridging paths contain removed nodes, then, as in case 3.1.B, 
we can extend at least one divergent (p/i(u(, j(-bridging path from u into a path 
Pj from u to e such that the length of Pj is bounded by dist{u) + 8 and the 
extended part of Pj is entirely in Sn[j]- 

Now we discuss the running time of the algorithm M-connectivity. 

Each path is constructed by the algorithm by searching a proper path in 
a specific substar Sn[j], which takes time 0(kjn + n(, where kj is the number 
of removed nodes in the substar Sn[j]- No substar is used in extending more 
than one such a path. Therefore, the time complexity for constructing all these 
paths is bounded by 0 {k 2 n + k^n -I- • • • -I- -|- n(ji — 1(( = 0(n^( since by our 

assumption k 2 + k^ + ■ ■ ■ k„ < n — 3. Thus, the time complexity of the algorithm 
M-connectivity is bounded by 0(n^(. □ 

The following example shows that the bound on the path length in Theorem [T] 
is actually almost optimal. Consider the n-star graph Sn- Let the source node 
be n = (21(, here we have omitted the trivial cycles in the cycle structure. Then 
dist{u) = 1. Suppose that no neighbors of u and of e are removed. By Theorem[l] 
there are n— 1 node-disjoint paths in S~ from u fro e. Thus, for each i, S < i < n, 
the edge [u,Ui], where Ui = (*21(, leads to one Pi of these node-disjoint paths. 
Note that the edge [u,Ui] does not follow the sp-rules. Now suppose that the 
n — 3 nodes (i2( (1( are removed, for i = 3, 4, . . . , n — 1. Then the third node on 
the path Pi must be Vi = (jz21( for some j ^ l,2,i, and the edge [ui,Vi] does 
not follow the sp-rules. Since the only edge from Vi that follows the sp-rules is 
the edge [vi,Ui], the next edge [vi,Wi] on Pi again does not follow the sp-rules. 
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Now since all the first three edges on Pi do not follow the sp-rules, by Remark 
1, dist{wi) = dist{u) + 3 = 4, and the path Pi needs at least four more edges to 
reach e. That is, the length of the path Pi is at least 7 = dist{u) + 6. Thus, with 
n — 3 removed nodes, among the n — 1 node-disjoint paths from u to e, at least 
n — 3 of them must have length larger than or equal to dist(u) + 6. 

The situation above seems a little special since the distance dist{u) from u 
to e is very small. In fact, even for large distance nodes u, it is still possible to 
construct examples in which some of the node-disjoint paths connecting u and 
e must have length at least dist{u) + 6. We leave this to the interested readers. 



5 Conclusion 

Strong Menger-connectivity is a natural extension of the study of d-connectivity 
on d-regular graphs, and has direct applications in network fault tolerance and 
network parallel routing. Since one of the motivations of network parallel rout- 
ing is to provide alternative routing paths when failures occur, strong Menger- 
connectivity can also be regarded as the study of fault tolerance in networks 
with faults. In this paper, we have demonstrated that the recently proposed star 
graphs are strongly Menger-connected. We presented an algorithm of running 
time O(n^) that for two given non-faulty nodes u and v in the n-star graph 
Sn with at most n — 3 faulty nodes, constructs the maximum number (i.e., 
min{de( 7 /(u), de 5 /(n)}) of node-disjoint fault-free paths from u to v such that 
the length of the paths is bounded by dist{u, v) + 8. We showed that the length 
of the paths constructed by our algorithm is almost optimal. The time complex- 
ity of our algorithm is optimal since each path from u to v may have length as 
large as 0(n), and there can be as many as n — 1 node-disjoint paths from u 
to V. In fact, since the n-star graph Sn has n! nodes, our algorithm is actually 
“sub- logarithmic” in terms of the input graph size. 

We should mention that Rescigno m recently developed a randomized par- 
allel routing algorithm on star graphs with faults, based on the Information 
Disersal Algorithm (IDA) [16] . The algorithm in |17] is randomized thus it does 
not always guarantee the maximum number of node-disjoint paths. Moreover, in 
terms of the length of the constructed paths and running time of the algorithm, 
our algorithm seems also to have provided significant improvements. 

The study of strong Menger-connectivity shows another advantage of the 
star graphs over other popular network structures. In particular, the orthogonal 
partition of the star graphs |Z], which decomposes the n-star graph into n — 1 (n— 
l)-substars plus an independent set, seems very convenient for construction of 
node-disjoint paths: we basically can construct a path in each separated substar, 
ensuring that the constructed path is node-disjoint with other paths. Other 
popular network topologies, such as the hypercubes, do not seem to have this 
nice decomposition structure. For example, in the construction of node-disjoint 
paths in a hypercube with faults, we seemed not able to construct each path 
using a separated subcube, which has made the construction more involved [IS]. 
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Abstract. Finding a cut or finding a matching in a graph are so simple 
problems that they are hardly considered problems at all. In this paper, 
by means of a reduction from the NAE3SAT problem, we prove that 
combining these two problems together, i.e., finding a cut whose split 
edges are a matching is an NP-complete problem. It remains intractable 
even if we impose the graph to be simple (no multiple edges allowed) or 
its maximum degree to be fc, with fc > 4. On the contrary, we give a linear 
time algorithm that computes a matching-cut of a series-parallel graph. 
It’s open whether the problem is tractable or not for planar graphs. 



1 Introduction 

We consider two elementary graph-theoretic problems: finding a matching in a 
graph, i.e., a set of disjoint edges; and finding a cut in a graph, i.e., a set of edges 
whose removal disconnects the graph (see Figure[T](a,b)). These two problems can 
be separately solved in so a straightforward way that they are hardly considered 
problems at all: an arbitrary edge is a matching, and the set of edges incident 
to an arbitrary vertex is a cut. However, we show that combining these two 
problems together, that is, finding a cut that at the same time is a matching 
(see Figure [T{^, yields an NP-complete problem. 




(a) 




(b) 




(c) 



Fig. 1. (a) A matching, (b) A cut. (c) A matching cut. 



The problem remains intractable even if we impose the graph to be simple 
(no multiple edges allowed) or its maximum degree to be k, with fc > 4. On 
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the contrary, we give a linear time algorithm that computes a matching-cut of a 
series-parallel graph. It’s open whether the problem is tractable or not for planar 
graphs. 

This interesting combinatorial problem firstly arose in a practical application. 
In fact, in ||6I3| is described a new approach to three-dimensional orthogonal 
graph drawing, based on generating the final drawing through a sequence of 
steps. The method starts from a “degenerate” drawing, in which all vertices and 
edges are overlapped on the same point. At each step the drawing “splits” into 
two pieces and finds a structure more similar to its final version. The observation 
that a cut involving non adjacent edges yields a more efficient (in terms of time 
of computation) and effective (in terms of volume occupation, number of bends 
introduced, and average edge length) performance, stimulated the investigation 
on the complexity of finding such cut. 

The paper is organized as follows: in Section E] basic definitions are given; 
in Section 0 the problem of finding a matching-cut is demonstrated to be NP- 
complete. Section [H shows that the problem retains its complexity even if we 
impose the graph to be simple or its maximum degree to be k, with fc > 4. 
In Section [5] we produce a linear time algorithm that finds a matching-cut in a 
series-parallel graph. 



2 Preliminaries 

A graph G{V, E) consists of a set V of vertices and a set E of edges (u, v), such 
that u and v are vertices in V. 

An edge (u, v) is said to be incident to vertices u and v. Two edges are 
adjacent if they are incident to the same vertex v. The degree of a vertex v is the 
cardinality of the set of its incident edges. In a simple graph every edge incides 
on two distinct vertices and two edges share at most a vertex. A path from u to 
u is a sequence of edges ei, . . . , Cfc such that, e\ is incident to u, Ck is incident 
to V, and for every i = I, . . . , A: — I, is adjacent to e^+i. A graph is connected 
if a path exists between each pair of vertices. In the following we consider only 
connected graphs. 

A cut of a graph is a partition of its vertex set V into two non-empty sets. Note 
that a cut of a graph with n vertices can be associated with an array of n elements 
with values in {0, 1}, where all the elements with the same value correspond to 
the vertices of the same block of the partition. Since (0, 0, ... , 0) and (1,1,...,!) 
are forbidden configurations (they don’t correspond to partitions), and since 
there are two configurations corresponding to the same partition, it follows that 
there are 2"“^ — 1 distinct cuts. 

If the graph is connected, a cut can be alternatively characterized by speci- 
fying the set of edges connecting vertices that do not belong to the same block 
of the partition. Observe that one description of the cut can be easily obtained 
from the other in linear time. 

A matching in a graph G is a set E' Q E such that each pair of distinct edges 
Ci and 6j of E' are not adjacent. 
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The problem of finding a cut in a graph can be solved in linear time: it suffices 
a random labeling of its vertices with the two labels 0 and 1. Imposing the cut to 
be maximum the problem becomes NP-complete ( 0 ) , but approximable within 
1.1383 ([II). 

Finding a matching can be solved in 0(1) time: each single edge is a matching. 

Although finding a cut and finding a matching are simple problems, we expect 
the problem of finding a cut that is a matching to be a more difficult task. In 
the following section we will demonstrate that this problem is NP-complete. 

3 NP-Completeness of the Matching-Cut Problem 

Formally, the Matching-Cut problem is defined as follows: 

Problem: Matching-Cut 

Instance: A connected graph G{V,E). 

Question: Does a set E' C E, such that E' is a cut and a matching, 

exist? 

Given a cut, it’s easy to check in linear time if it is a matching by marking 
the extremes of each edge belonging to the cut. If no vertex is marked twice, 
the cut is also a matching. A non-deterministic Turing machine may generate 
all possible cuts and check in linear time if the cuts are matching-cuts. It follows 
that the matching cut problem is in NP. 

We will demonstrate that the Matching-Cut problem is NP-hard by means 
of a reduction from the NAE3SAT problem: 

Problem: Not-All-Equal-3-Sat (NAE3SAT) 

Instance: A set C of clauses, each containing 3 literals from a set of 

boolean variables. 

Question: Can truth values be assigned to the variables so that each 

clause contains at least one true literal and at least one false literal? 

The NAE3SAT problem was demonstrated to be NP-hard by Schaefer j^. 

Given a formula cj) in conjunctive normal form with v variables xi,. . . ,Xy 
and m clauses Ci, . . . ,Cm, with three literals each, we construct a graph G((()) 
that admits a matching-cut if and only if (f) is satisfiable within the NAE3SAT 
constraints. 

We start by building two chains of 2m -|- v double linked vertices as shown 
in Figure El 

In the following, the upper chain will be called FALSE-chain, and the lower 
one TRUE-chain. Since the vertices of each chain are linked with multiple edges, 
no matching-cut separates two vertices of the same chain, and each chain belongs 
to the same block of the partition that constitutes a matching-cut. We build 
the remaining part of the graph G{(j)) adding subgraphs connected both to the 
upper and to the lower chains in such a way that, if a matching-cut exists, it 
necessarily separates the two chains cutting through all the inserted subgraphs. 
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FALSE CHAIN 



TRUE CHAIN 
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^ 



Fig. 2. First step of the construction of G{4>): we create two chains of 2m + v double 
linked vertices. 



FALSE CHAIN 




FALSE CHAIN 




(a) (b) 

Fig. 3. The variable-gadget for variable Xi (a) and the only two possible matching-cuts 
(b). 



We call TRUE-set the block of the partition which the TRUE-chain belongs to, 
and FALSE-set the other. 

The subgraphs to be introduced are of two types: the variable-gadget and 
the clause-gadget. Figure[3]-a shows the variable-gadget. We introduce a variable- 
gadget for each boolean variable xi. The only way to cut the variable-gadget with 
a matching-cut is along the lines of Figure [3]-b. Each cut leaves the Xi vertex 
and the Xi vertex on the two opposite sets. Any other cut is not allowed, since 
it is not a matching. 

Figure |4] shows the clause-gadget. We introduce a clause-gadget for each 
clause of the (j) formula. The three black vertices of Figure |4] correspond to the 
literals li, Ij, and Ik of the clause {U V Ij V Ik)- There are six ways to cut through 
the clause-gadget with a matching-cut, as Figure El shows. Each cut corresponds 
to the truth assignment for the literals illustrated in Table [H 

Note that there is no matching-cut for the clause-gadget that leaves all the 
three vertices corresponding to the literals on the same side, and that there is 
no other matching-cut that could separate a subgraph of the clause-gadget. 

Finally, each vertex representing a literal k, Ij, or Ik of the clause-gadget is 
connected to the vertex representing the corresponding literal of the variable- 
gadget by means of two edges, in order to impose that they will belong to the 
same block of the partition determined by the matching-cut. Figure E] shows 
the whole construction for a formula with three boolean variables and a single 
clause. 
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FAT.SF. CHAIN 




TRUE CHAIN 



Fig. 4. The clause-gadget for clause {kV I j \/ Ik)- 



FAI,SF CHAIN 




FALSE CHAIN 




Fig. 5. The six matching-cuts of the clause-gadget of Fig. [4] 



Table 1. Truth assignment corresponding to the six cuts showed in Figure]^ 
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true 
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true 


true 


true 


no cut 
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Fig. 6 . Construction for the first three variables and a clause {Xi V X 2 V -^ 3 ). 



We claim that a matching-cut can be found in the graph G((/>) if and only if 
the corresponding instance (j) of the NAE3SAT problem has a solution. 

Suppose that such cut exists. Since the chains can not be cut and since 
no lesser part of the gadgets can be separated from the rest of the graph, the 
matching-cut has to separate the two chains, cutting through all the gadgets that 
tie them together. All the variable-gadgets are attached to the FALSE-chain and 
to the TRUE-chain, so the matching-cut must cut through their edges according 
to one of the two ways shown in Figure [3fb), determining a truth assignment 
of the boolean variables. Leaving a variable-vertex on a set implies that the 
corresponding literals in the clauses belong to the same set, and the negate 
literals belong to the other set. The clause-gadgets are also cut into two parts by 
the matching-cut, and necessarily according to one of the six ways illustrated in 
Figure El implying that at least one of the literal- vertices belongs to the TRUE- 
set and the other to the FALSE-set. So each clause, and the 4> formula, is satisfied 
as requested by the NAE3SAT problem. 

Conversely, if a truth assignment exists such that each clause has at least a 
true literal and a false one then a matching-cut can be constructed so that 

— it cuts each variable-gadget leaving the variable- vertex in the set correspond- 
ing to its boolean value and the negate literal is in the other set, 

— it cuts each clause-gadget using the cut corresponding to the given truth 
assignment of the literals as shown in Table [T| 

It’s easy to verify that the cut so obtained is a matching-cut. The following 
theorem is therefore demonstrated: 

Theorem 1. The Matching- Cut problem for a connected graph Gis NP-complete. 

4 Simple Graphs 

and Graphs with Maximum Degree Four 

The construction described in SectionElcan be easily modified in order to demon- 
strate that the problem retains its complexity even for simple graphs. Namely, 
we note that between a pair of vertices of the graph G{(p) there can be a single 
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o==o 





Fig. 7. (a) Replacing an edge with a path of two edges to eliminate double edges, (b) 
Replacing a vertex with a star-shaped graph to eliminate vertices with degree greater 
that 4. 



edge or two, and in the latter case we can replace one of the two edges with a 



two edges long path, as shown in Figure 7(a) 



It’s easy to see that the three vertices of the triangle are forced to be in the 
same set of the bipartition induced by a matching-cut. So, the following corollary 
is demonstrated: 



Corollary 1. The Matching-Cut problem for a simple connected graph G is NP- 
complete. 

We modify the same construction to demonstrate that the problem remains 
NP-complete even imposing a maximum degree k, with fc > 4 for the vertices 
of the graph. Observe that the vertices of the proposed construction have even 
degree. Also, edges are introduced two-by-two in order to impose that the double 
linked vertices will belong to the same set of the bipartition determined by a 
matching-cut. So, each vertex with degree greater than 4 can be replaced with 
the star construction of Figure [7(b) | in which every vertex has degree four. It’s 
easy to see that a matching-cut can not separate the vertices of the star-shaped 
subgraph of Figure |7(b)| 

Corollary 2. The Matching-Cut problem for a connected graph G (simple or 
not) of degree k, with k > 4 is NP-complete. 



5 Matching-Cuts of Series-Parallel Graphs 

In this section we describe a linear time algorithm that decides whether an 
SP-graph admits a matching-cut. Further, we modify the algorithm in order to 
return a matching-cut if exists. 

A graph G with no self-loops is series-parallel (or SP-graph) if two of its 
vertices s and t are selected to be respectively the source and the sink of G 
(denoted G(s, t)), and G can be built by recursively applying the following rules: 
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basic step graph G{s,t) consists of a single edge between s and t; 
serial composition graph G(s, t) is the union of two SP-graphs G„(s„, and 
Gd{sd, td) such that s = s„, tu = Sd, and t = td\ 
parallel composition graph G{s,t) is the union of two SP-graphs Gi{si,ti) 
and Gr{sr, G) such that s = si = and t = ti = G- 

Each SP-graph can be represented by a binary tree |H] in which nodes are of 
three types: Q, S, and P. Nodes types have the following meanings: 

— A Q-node represents a single edge 

— An S-node represents the serial composition of the graphs corresponding to 
its children, with the convention that the left child represents G„ and the 
right child represents Gd 

— A P-node represents the parallel compositions of the graphs corresponding 
to its children, with the convention that the left child represents G/ and the 
right child represents Gr 

Such tree is called parse tree. In order to guarantee the uniqueness of the 
parse tree for a given SP-graph, it is further required that, if a node has 
the same type of its parent, then is its right child. It is possible to recognize an 
SP-graph (and build its parse tree) in linear time [8]. 

We associate each node of the parse tree with an SP-graph. For S-nodes and 
P-nodes the associated SP-graph is obtained by applying the rule represented 
by the node to the SP-graphs associated with the children. For brevity, in the 
following we do not distinguish between the node of an parse tree and the graph 
associated with it. 

Our algorithm takes as input an SP-graph G{s,t) and decides if it admits 
a matching-cut. It is based on a post-order traversal of the parse tree. In the 
following we say that for a given SP-graph a matching-cut is st-separating if it 
leaves s and t in distinct sets. Also, we say that vertex s (t) is engaged by a 
st-separating matching-cut if one of the edges of the matching-cut is incident to 
s (t). 

We associate two labels ^i(G') and h{G') with each node G'{s,t) of the 
parse tree of G. Label hiG') signals if G' admits at least one non st-separating 
matching-cut {hiG') = 1) or not {hiG') = 0). Note that, if hiG') = 1 then for 
each ancestor G" of G' it must hold h{G") = 1. 

Label 1\{G') signals whether G' has at least one st-separating matching-cut 
and whether vertices s and t are necessarily engaged. The six possible values of 
label ^i(G') are listed below along with their meanings. 

0 There does not exist an st-separating matching-cut for such node. 
s ■ t There exists at least one st-separating matching-cut and in all matching- 
cuts both s and t are engaged. 

s There exists at least one st-separating matching-cut. In all matching-cuts s 
is engaged. Further, in some of them, t is not engaged. 

t There exists at least one st-separating matching-cut. In all matching-cuts t 
is engaged. Further, in some of them, s is not engaged. 
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s(Bt There exists at least one st-separating matching-cut that engages s and 
not t and there exists at least one st-separating matching-cut that engages t 
and not s. Further, no matching-cut that does not engage either s or t exist. 

1 There exists at least one st-separating matching-cut in which neither s nor 
t are engaged. 




Fig. 8. Examples of SP-graphs with their h labeling, (a) No matching-cut exists (Zi = 
0). (b) There exists at least one matching-cut engaging t only {h = t). (c) There exists 
at least one matching-cut engaging s only (Zi = s). (d) All matching-cuts engage both 
s and t (Zi = s ■ t). (e) There exists at least a matching-cut that engages s and not t 
and at least a matching-cut that engages t and not s (Zi = s © t). (f) There exists at 
least a matching-cut that does not engage neither s nor t {h = 1). 



The labels h and I 2 may be computed recursively for each node G, applying 
one of the following rules, depending on the type of the node: 



Q-node li{G) = s ■ t and hiG) = 0. 

S-node Let and Gd{sd,td) be the children of G. The value of h{G) 

is given in the table shown in Figure 9(b) which summarizes the following 
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(a) Parallel composition 



(b) Serial composition 



Fig. 9. Tables for computing h{G) from the labels of the children of G. 
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observations. If h{Gu) = h{Gd) = 0 then G does not have an st-separating 
matching-cut. In all other cases the conditions on = Sd are not propagated 
to h{G), while the conditions on Su and td are. Note that, since an st- 
separating matching-cut either is st-separating for Gu or is st-separating for 
Gd, it can never be h{G) = s ■ t. 

The label hiG) = 1 if and only if hiGu) = 1, or hiGd) = 1, or it is possible 
to find two st-separating matching-cuts, one in and one in Gd, such that 
tu = Sd is not engaged by both of them. Such observations are summarized 
by the table in Figure [TU] 
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Fig. 10. The table for computing hiG) from the labels of the children of G. 



P-node Let Gi and Gr be the children of G. The value of /i(G) is given in 
the table shown in Figure [9(a) which summarizes the following observations. 
If one between G/ and G^ does not admit an st-separating matching-cut, 
so does G. If G/ (Gr) admits an unconditioned st-separating matching-cut 
then li{G) = ^i(Gr) (^i(G) = li{Gi)). In all other cases G may have an 
st-separating matching-cut if and only if it is possible to find two matching- 
cuts, one in Gi and one in Gr, that do not engage the same vertex (s or t). 
Hence, in such cases, /i(G) = s ■ t. 

The label hiG) is 1 if and only if hiGi) = 1 or hiGr) = 1- 



Theorem 2. There exists a linear time algorithm to verify if an SP-graph ad- 
mits a matehing-cut. 

Proof, (sketch) The correctness of the algorithm follows from the correctness 
of the recursive computation of the labels h and h described above. For the 
time complexity, it suffices observing that the number of the nodes in the parse 
tree associated with an SP-graph of m edges is bounded by 2m, and therefore 
a post-order traversal can be performed in linear time. For each node G of the 
parse tree the values of hiG) and hiG) can be computed in constant time, since 
they depend only on the values of its two children. 

The algorithm can be easily modified as follows in order to return a matching- 
cut if it exists. Given the input SP-graph, we associate with each node G(s,t) 
of its parse tree three sets Si, S[, and 82 - Such sets are subsets of the edges of 
the input SP-graph and their are used as follows. If /i(G) = s ■ t then subset Si 
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is a matching-cut for G. If h{G) = s {h{G) = t) then subset is a matching- 
cut that engages s (t) only. If li{G) = s (B t then Si is a matching-cut that 
engages s only, and subset S[ is an alternative matching-cut that engages t only. 
If li{G) = I then subset Si is a matching-cut that engages neither s nor t. If 
h{G) = I then S2 is a non st-separating matching-cut. 

For each node G of the parse tree the three sets Si, S[, and S2 can be updated 
in the same post-order traversal that calculates the values of the labels li and I2 
by suitably combining the sets associated with the children of G. Since at each 
steps the sets of the children are not needed anymore, the involved set operations 
(union and copy) may be performed in constant time. It follows that the problem 
of finding a matching-cut in an SP-graph can be solved in polynomial (in fact, 
linear) time. 

6 Conclusions and Open Problems 

We studied the problem of finding a matching-cut in a graph, and showed that in 
the general case the problem is NP-complete, and that it retains its complexity 
even if the graph is simple or its maximum degree is bounded by a constant 
k> A. 

As for other classes of graphs, we described an algorithm to find a matching 
cut (if any) in a series-parallel graph in linear time, so demonstrating that the 
problem is in P for this particular class of graphs. 

It’s open whether the problem is tractable or not for planar graphs. 
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Abstract. In this paper we prove the NP-hardness of various recog- 
nition problems for subgraphs of De Bruijn graphs. In particular, the 
recognition of DNA graphs is shown to be NP-hard; DNA graphs are 
the vertex induced subgraphs of De Bruijn graphs over a four letter al- 
phabet. As a consequence, two open questions from a recent paper by 
Blazewicz, Hertz, Kobler & de Werra [Discrete Applied Mathematics 98, 
1999] are answered in the negative. 
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1 Introduction 

Blazewicz, Hertz, Kobler & de Werra introduced the concept of DNA graphs 
to model the computing and reconstruction phase of DNA chain sequencing by 
hybridization. For more information on the biological background, we refer the 
reader to Blazewicz, Hertz, Kobler & de Werra or to Bains & Smith [1]. 
The graph theoretical background is fairly easy to describe: Let G = (V, A) be 
a directed and simple graph that may contain loops. An (a, k) -labeling of G 
assigns a label L{v) = [G{v) , ^ 2 {v) , . . . ,(-k{v)] to every vertex v G V such that 

(LI) All entries ii{v) in the labels of all vertices v G V are from an alphabet of 
size a (e.g., from the set {!,..., a}). 

(L2) For u,v G V with v, L{u) ^ L{v). Thus, different vertices get different 
labels. 

(L3) For u,v G V, [£ 2 {u), ■ ■ ■ ,£k(u)] = [£i(u), . . . , £fc_i(u)] holds if and only if 
there is an arc (u, v) G A. In other words, an arc is encoded by the fact 
that the last k — 1 entries of the label of the tail-vertex are equal to the 
first fc — 1 entries of the label of the head-vertex. 

For integers a > 2 and A: > 1, we denote by the class of all directed simple 
graphs that possess an (a, fc)-labeling. The set U^i is denoted by and 
the set is denoted by Blazewicz, Hertz, Kobler & de Werra [3] 

call a graph G a DNA graph if and only if G G The four letters in the 
underlying alphabet correspond to the four nucleotide bases adenine (A), guanine 
(G), cytosine (G), and thymine (T). 



A. Brandstadt and V.B. Le (Eds.): WG 2001, LNCS 2204, pp. 296- 130^ 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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Fifty years ago and working in a somewhat different line of research, De 
Bruijn |4] studied the subwords of certain circular sequences. To this end he 
investigated the combinatorial structure of directed graphs whose vertex set 
consists of the possible words of length k over an alphabet of size a. Two 
vertices are connected by an arc if and only if the last k — 1 letters of tail-word 
are equal to the first k — 1 letters of the head-word. Such a graph is nowadays 
called a De Bruijn graph, and it is denoted by B{a, k). De Bruijn graphs are used 
in communication networks and in VLSI design; cf. e.g. Samathan & Pradhan 
|8]. It is straightforward to see that a graph is a member of the above defined 
class if and only if it is the vertex induced subgraph of the De Bruijn graph 
B{a, k) with word length k and alphabet size a. Moreover, a graph is a DNA 
graph if and only if it is the vertex induced subgraph of i?(4, k) for some k. 

In this paper we will study the computational complexity of the membership 
problems for the classes and for various values of a > 2 and fc > 1. 

Let us start our discussion with the classes 

— For k = 1, the membership problem for £“ is trivial: A directed simple graph 
G = {V, A) is in £“ if and only if A = V xV and \V\ < a. 

— For k = 2, Blazewicz, Hertz, Kobler & de Werra jS] design a polynomial time 
algorithm that decides for any input graph G and any input parameter a, 
whether G is a member of £2 • 

— For any fixed number fc > 3, we will prove in this paper (cf. Theorem 0) 
that it is NP-hard to decide for an input graph G and an input parameter 
a whether G € 

— For a = 2, the complexity of the membership problem for £^ is unknown. 
We conjecture that it is polynomially solvable. 

— For any fixed number a > 3, we will prove in this paper (cf. Theorem |S)) 
that it is NP-hard to decide for an input graph G and an input parameter k 
whether G € 

Note that if a and k both are not part of the input, then the membership problem 
for £^ is easy. In this case the size of class £^ is a fixed constant that does not 
depend on the input, and we simply may search through all of it. Next, we turn 
to the membership problems for the classes £(^ and £^ . Some of these problems 
are fairly close to the membership problems for the classes £^. 

— Blazewicz, Hertz, Kobler & de Werra give a polynomial time algorithm 
that takes a graph G and a parameter k as input, and correctly decides 
whether G is in £^. 

— For a = 2, the complexity of the membership problem for £^ is unknown. 

— For any fixed number a > 3, we will prove in this paper (cf. Theorem El 
that it is NP-hard to decide for an input graph G whether G G £(^. 

At the end of [3], the authors pose five open questions on the computational 
complexity of recognizing graphs in various classes £^. Question 1 considers a 
graph G with a given (a, fc)-labeling, and it asks to find the largest possible label 
length q such that G is in £^ for some appropriate value of /3. This question has 



298 Rudi Pendavingh, Petra Schuurman, and Gerhard J. Woeginger 

been answered by Blazewicz, Formanowicz, Kasprzak & Kobler 0 who design 
a polynomial time algorithm for it. Question 2 concerns the complexity of the 
membership problem in C^. This question remains open. Question 3 asks for 
a polynomial time algorithm for the following problem: Given an integer k and 
a graph G, find the smallest integer a such that G is in . Our Theorem 
answers this question in the negative; the problem is NP-hard. Question 4 asks 
for a polynomial time algorithm for the membership problem for Theorems 
and[ 8 | answer this question in the negative, and they show that the problem is 
NP-hard. Question 5 asks for a polynomial time algorithm for the following 
problem: Given a graph G with a given (a, fc)-labeling, determine all integers q 
such that G is in £“. This question remains open. 

Organization of the paper. In Section [2] we state some notation, we recall 
the definition of the graph coloring problem, and we derive some facts on De 
Bruijn graphs. Section O deals with the problem variants for fixed label lengths, 
and Section 0 ] deals with the problem variants for fixed sizes of the alphabet. 

2 Preliminaries 

Throughout the paper, labels will sometimes be considered as words over an 
appropriate alphabet, and then the entries of the labels will be considered as the 
letters of these words. For words wi and W2, we define in the usual way their 
concatenation rci • W2 that results from appending word W2 at the right end of 
word Wi- 

All our NP-hardness proofs will be done by reductions from the graph color- 
ing problem (cf. Garey & Johnson [S|)- This coloring problem remains NP-hard 
even if the color bound 7 > 3 is not part of the input. 

GRAPH GOLORING 

Input: An undirected graph H = (X,E) with |A| = n vertices, and a color 
bound S < ^ < n. 

Question: Does H have a feasible 7 -coloring, i.e., does there exist a function 
/ : X — >• { 1 ,..., 7 } such that /(x) yf f{y) for all edges (x, y) G El 

In the rest of this section, we investigate when a De Bruijn graph B{a, k) is a 
subgraph of another De Bruijn graph B{f3, h). This will lead to a gadget for our 
NP-hardness proof in Theorem O 

Lemma 1 Let a, (3 >2 and k,h > 2 be integers such that 2a > (3. If the De 
Bruijn graph B{a, k) is a subgraph of B(/3, h), then B{a, k — 1) is a subgraph of 
B{(3,h-1). 



Proof. To simplify presentation, we will sometimes identify vertices with their 
labels. Assume that B{a, k) is a subgraph of B{(3, h), and let / : {1, . . . , a}* —>■ 
{!,..., /3}^ denote the corresponding injection between the two label sets. Gon- 
sider an arbitrary label v of length k — 1 over {!,..., a}. There are a distinct 
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vertices in B{a,k) whose labels end with the k — 1 entries in r;; we denote this 
vertex set by Sy. Moreover, there are a distinct vertices in B{a, k) whose labels 
start with the k — 1 entries in v; we denote this vertex set by T„. Note that 
the sets Sy and Ty are not necessarily disjoint. However, for u v we have 
Sy r\ Su = 9 and fl = 0 . 

In the graph B(a,k) there is an arc from every vertex in Sy to every vertex 
in Ty. Now consider the vertices in f{Sy) = {/(s)|s G and in f(Ty) = 
G Ty}. Since B{a,k) is a subgraph of B{/3,h), all labels of vertices in 
f{Sy) must end with the same h — 1 entries [£ 2 , ■ . . ,ih] = w, and all labels of 
vertices in f(Ty) must start with the same h — 1 entries w. Summarizing, these 
observations define for every label v in a unique corresponding 

label w = g{v) in {1, . . . We claim 

(i) that the function g : {1, . . . , a}^~^ — >■ { 1 , . . . , is an injection, and 

(ii) that g certifies that B{a, /c — 1) is a subgraph of B(/3, h — 1). 

To see (i), suppose that for u,v G {1, . . . , with u v and for w G 

{1, . . . , we have g{u) = g{v) = w. As f is an injection, f{Sy U Sy) is 
a set of 2a labels, all with last h — 1 entries equal to w, thus with 2 a distinct 
first entries. This contradicts the assumption that 2a > /3. 

To see (ii), consider an arc from vertex u to vertex v in the graph B(a, k — 1). 
Then the label of u starts with an entry x followed by a sequence y oi k — 2 
entries, and the label of v starts with the sequence y followed by an entry z. 
Consider the vertex w in B{a, k) with label x - y ■ z, and let the label of f{w) be 
x' ■ y' ■ z' where x' and z' are single entries and where y' is a sequence oi h — 2 
entries. With this notation we have f{w) = f{x ■ y ■ z) = x' ■ y' ■ z' , and hence 
g{u) = g{x ■ y) = x' ■ y' and g{v) = g{y ■ z) = y' ■ z' . Consequently, there is also 
an arc in the graph B{/3, h — 1) going from vertex g(u) to vertex g{v). □ 

Theorem 2 Let a, /3 > 2 and k,h > 1 be integers sueh that 2a > j3. If the De 
Bruijn graph B{a, k) is a subgraph of the De Bruijn graph B{f), h), then h = k. 

Proof. Omitted in this extended abstract. □ 

We note that since B{2,2) is a subgraph of the complete directed graph on 
4 vertices B{4, 1), the requirement that 2a > /3 in the statement of Theorem E] 
is necessary. 

3 The Problem Variant with a Fixed Label Length 

Let H = (X,E) and 7 be an arbitrary instance of GRAPH COLORING. Let 
xi, . . . ,Xn be an enumeration of all vertices in X and let ei, . . . , em be an enu- 
meration of all edges in E. Without loss of generality we assume that H does 
not contain any isolated vertices. We will now construct in polynomial time a 
directed graph G = {V, A) and an alphabet size a, such that G is in Cf if and 
only if H is 7 -colorable. 

— For every vertex Xi G X with 1 < i < n, the graph G contains two cor- 
responding vertices v{xf) and v'{xi). There is a loop at every vertex v(xi). 
From every vertex v(xi) with 1 < i < n there is an outgoing arc to v'(xi). 



300 Rudi Pendavingh, Petra Schuurman, and Gerhard J. Woeginger 



— For all 1 < i ^ j < n, there is a directed path with three arcs from v{xi) to 
v{xj) through the two intermediate vertices Vi{xi,Xj) and V 2 {xi,Xj). From 
every vertex V 2 {xi,Xj) there is an outgoing arc to v'{xj), and an outgoing 
arc to every vertex v\{xj,Xk) with 1 < fc < n and k ^ j. 

— For every edge Cg & E with 1 < s < to, the graph G contains a corresponding 
vertex v{es). There is a loop at every v(es). 

— For all 1 < s yf t < TO, there is a directed path with three arcs from v(eg) 
to v(et) through the two intermediate vertices Vi(eg, et) and U2(es, Ct). From 
every vertex ^2(65, e*) there is an outgoing arc to every vertex fi(et, e„) with 
1 < M < TO and u ^ t. 

— If vertex Xi is incident to edge in H, then G contains a directed path 
from v'{xi) to v{es) via vertices vi{xi,ea) and V 2 {xi,es). From every vertex 
V2{xi,es) there is an outgoing arc to every vi(eg,et) with 1 < t < m and 
t ^ s. 

This completes the construction of graph G = {V,A). Finally, we define the 
alphabet size a = n + to + 7. To simplify the presentation of the following argu- 
ments, we assume that the a letters in the alphabet are the vertices xi, . . . ,cc„ 
and the edges ei, . . . , Cm in i7, together with 7 colors ci, . . . , c-y. 

Lemma 3 //G = (V, A) has an (a, 3) -labeling, then the graph H is ^-colorable. 

Proof. Consider an {a, 3)-labeling of G. It is easy to see that the label of a vertex 
with a loop must consist of three identical letters. Without loss of generality we 
assume that every vertex v(xi) is labeled by [xi,Xi,Xi] and that every vertex 
v{ea) is labeled by [e^, 65,65]. From this we derive that vertex vi{xi,Xj) has 
label [xi,Xi,Xj], and that vertex 62(64,65) has label [64,65,65]. 

Now consider vertex v'{xi). Since it is a successor of v{xi), its label must 
be of the form [xi, Xi, a] where cr is some letter from the alphabet. The letter a 
can not be equal to Xi, since then v'(xi) and v(xi) would have the same label 
in contradiction to property (L2). The letter a can also not be equal to xj with 
i yf j, since then vertices v'(xi) and Vi{xi,Xj) would have the same label. Since 
Xi is not an isolated vertex in H, there is a path from v'{xi) to some vertex 
6(65) through the intermediate vertices v\{xi,es) and 62(3:4, 65). Then the label 
of vertex 61(3:4,65) equals [xi,cr, 65], and the label of vertex 62(0:4,65) equals 
[(7,65,65]. We conclude that a can neither be equal to 65 (since then 62(0:4,65) 
and 6(65) had the same label) nor can it be equal to some 64 with t yf s (since 
then 62(^4, 65) and 62(04, 65) had the same label). The only remaining possibility 
for (T is that it is one of the colors ci, . . . , c.y. 

We now define f{xi) = Ui for 1 < i < n, where [3:4, 3:4, CT4] is the label of vertex 
v'(xi). By the above discussion, every (T4 is one of the colors ci, . . . , c-y. We claim 
that this yields a feasible coloring. Indeed, suppose that f{xi) = f{xj) = a 
where Xi and xj are connected to each other by an edge 65 in H. Then the 
label of 62(3:4, 65) equals [cr, 65, 65], and the label 62(3:^-, 65) also equals [cr, Cg, 65], 
a contradiction to property (L2) . □ 
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Lemma 4 If the graph H is ^-colorable, then G = (V,A) has an {a, S) -labeling. 

Proof. Consider a feasible 7-coloring of H that assigns to every vertex Xi G X a 
color (Ti from the colors ci, . . . , c-y. We define the following labeling: 

— For 1 < f < n, the label of v{xi) is [xi,Xi,Xi] and the label of v'(xi) is 

[Xi , Xi , CTi\ . 

— For all 1 < z j < n, the label of v\{xi,Xj) is [xi,Xi,Xj] and the label of 

V2{Xi,Xj) is [Xi,Xj,Xj]. 

— For 1 < s < TO, the label of v{es) is [e^, Cg, 6s]. 

— For all 1 < s t < TO, the label of ?;i(es,et) is [es,eg,et] and the label of 
V2{es,et) is [cg, 64,64]. 

— If vertex Xi is incident to edge 6g, then the label of vi{xi,Cs) is [2:4, tJi, 6g], 
and the label of V2{xi, 6g) is [0-4, 6g, 6gj. 

Clearly, this labeling assigns distinct labels to distinct vertices. To verify that the 
labeling also fulfills property (L 3 ), we divide the vertices of G into five classes: 
The first class Vi contains all vertices v(xi), vi(xi, Xj), and V2{xi, xj); the entries 
in the labels of all these vertices are from Xi, . . . ,Xn. The second class V2 contains 
all vertices v'{xi); the first two entries in the labels of these vertices are from 
xi, . . . , Xn, and the last entry is from Ci, . . . , c-y. The third class V3 contains all 
vertices Vi{xi,es); their labels consist of some vertex Xi, followed by a color, 
followed by an edge. The fourth class V4 contains all vertices V2{xi,es)', their 
labels consist of some color, followed by two edges. The fifth class V5 contains 
all vertices v{es), vi{es,et), and V2{es,et); their labels are triples of edges. 

By the definition of the labeling, there can only be arcs within Vi, arcs from 
Vi to V2, arcs from V2 to V3, arcs from V3 to V4, arcs from V4 to V5, and arcs 
within V5. It is now straightforward but somewhat tedious to verify that all 
arcs within Vi and within V5, from Vi to V2, and from V4 to V5 are correctly 
encoded. Moreover, every vertex Xi has a unique color (T4, and therefore every 
label [xi, CT4, 6g] of a vertex in V3 only allows a unique predecessor and a unique 
successor. □ 

Combining the statements of Lemma E] and |H now yields the statement of The- 
orem E] below for fc = 3 . The construction is easily extended to the cases with a 
fixed fc > 4 . The main idea is to replace in our construction all the connecting 
paths with three arcs by new connecting paths with k arcs. The details are left 
to the reader. 

Theorem 5 For any fixed k > 5 the following problem is NP-hard: Decide for 
an input graph G and an input parameter a whether G € 

4 The Problem Variant with a Fixed Alphabet Size 

Let H = (X, E) and 7 be an arbitrary instance of GRAPH COLORING. By 
adding at most |if| — 1 independent edges to H, we can make \E\ a perfect 
power of two without increasing the chromatic number of H. Then by adding 
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some isolated vertices (again without increasing the chromatic number of H), 
we can make |X| = \E\. Summarizing this yields that we may assume without 
loss of generality that n = \X\ = \E\ = 2^ holds for some integer z > 4. 

Let xi, . . . ,Xn be an enumeration of all vertices in X and let ei, . . . , e„ be 
an enumeration of all edges in E. We will now construct in polynomial time a 
directed graph G = (V, A) and a label length k, such that G is in C]. if and only 
if E[ is 7 -colorable. 

— For every vertex Xi G X with 1 < i < n, the graph G contains a correspond- 
ing complete binary out-tree T{xi) of height z. The root of this tree is the 
vertex v(xi). Every interior vertex in T(xi) has a left and a right out-going 
arc that connect it to its two children. All 2^ leaves in T(xi) are at the same 
distance z from the root. Enumerating the leaves in T{xi) from left to right, 
they are called v{xi, ei),v{xi, 62 ), ■ • ■ , v{xi, e„). 

— For every edge Cg G E with 1 < s < n, the graph G contains a corresponding 
vertex v{es). 

— If vertex Xi is incident to edge in El, then G contains a directed path 
P{xi,e.s) from v{xi,es) to u(cs). The path P{xi,es) consists of z -I- 2 arcs 
and of z-|- 1 new vertices. The interior vertices on this path are not connected 
to any other parts of graph G. 

This completes the construction of graph G = {V, A). We define the label length 
k = 2z -t- 3. 

To simplify the presentation of the following arguments, we assume that the 
alphabet E consists of the letters 0,1 ,..., 7 — 1. Some of the labels will be 
concatenated from the binary representations of certain integers. For an integer 
i in the range 1 < t < 2 ^, we define bin(z) to be the z-digit binary representation 
of i — 1. This representation always contains an appropriate number of leading 
zeroes so that its length exactly equals z. For non-negative integers i and for 
letters a G E, we will use ct* to represent the word consisting of exactly i letters 
a. 

Lemma Q If G = {V,A) has a {'y,k) -labeling, then the graph H is ^-colorable. 

Proof. Consider a ( 7 , /c)-labeling of G. For any vertex Xi G X, we define its color 
f{xi) to be the (2z-|-2)th digit of the label of the root v{xi). Since the alphabet 
size is 7 , this coloring uses only 7 colors. We claim that this yields a feasible 
coloring. 

Indeed, suppose that f{xi) = f{xj) = a where Xi and Xj are connected to 
each other by an edge Cg in H. We consider the predecessors of v{eg) on the 
paths P{xi,€s) and P{xj,eg), and call these vertices w{xi,eg) and w(xj,eg), 
respectively. There is a directed path of 2z -I- 1 arcs from v(xi) to w(xi,eg). As 
a consequence of property (L3), the (2z -I- 2)th digit of the label of v(xi) must 
be equal to the first digit of the label of w(xi, 6 g). Hence, this first digit equals 
cr. By analogous reasoning we get that the first digit of the label of w(xj,eg) 
also equals a. Since w(xi,eg) and w(xj,eg) both are predecessors of v(eg), the 
last k — 1 digits of their labels must agree with the first fc — 1 digits of the label 
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of v(es). But now the labels of w{xi,es) and w{xj,es) agree in the first digit 
and also in the last k — 1 digits, and hence they are equal to each other. This 
contradicts property (L2). □ 

Lemma 7 If the graph H is ^-colorable, then G = {V,A) has a (j,k) -labeling. 

Proof. Consider a feasible y-coloring of H that assigns to every vertex Xi G X a, 
color CTi from X. We define the following partial labeling: 

— For 1 < i < n, the label of v{xi) is ■ 0 • bin(z) • 2 • • 1. 

— For 1 < s < n, the label of v{es) is 1 • bin(s) • 2^+^. 

All other vertices are on some path with 2z + 2 arcs from some root v{xi) to 
some v{es). If a vertex v is j arcs away from v{xi) and 22; + 2 — j arcs away from 
v{es), then its first 2z + 3 — j letters are the last 2z-\-H — j letters of the label of 
v{xi), and its last j + 1 letters are the first j + 1 letters of the label of v{es). This 
completely determines the label of v. Summarizing, the label of every vertex on 
path P{xi, 6s) is an appropriate subword of length 2z + 3 of the word 

2^~^ ■ 0 • BiN(t) • 2 • (Ti • 1 • bin(s) • 2^+^. 

For an illustration, the reader may want to verify that the label of the leaf vertex 
v{xi, 6s) in T{xi) equals bin(i) • 2 • • 1 • bin(s). Now we state some simple but 

important observations on these labels. 

First: By considering a label L(v), one can easily determine whether the 
corresponding vertex v is in one of the trees T{xi), or lies on one of the paths 
P{xi,es), or is one of the vertices v{es). Indeed, if a label L{v) has a 0 or a 1 
as last digit, then the vertex v is contained in some tree T{xi). And if the label 
L{v) ends with a 2, then the vertex v lies on one of the paths P{xi, eg), or it is 
one of the vertices u(es); and v is one of the vertices v{eg) if and only if its label 
L{v) ends with a block of 2’s of length 2: + 2. 

Second: If a vertex v is contained in some tree T{xi), then its label L{v) 
starts with a block of 2’s that is followed by a 0 that in turn is followed by z 
binary digits that uniquely identify the index i. For a non-leaf vertex v, the only 
possible labels of successors of v result by removing the first digit from L{v) and 
by appending a 0 or a 1 at its right end; this correctly encodes the tree structure 
of T{xi). For the leaves v{xi,es), there is at most one possible successor label 
that results by removing the first digit (0 or 1) and by appending a 2. Such a 
successor vertex exists if and only if Xi is incident to Cg. Hence, also the initial 
arcs of the paths P{xi, Cg) are correctly encoded by the labeling. 

Finally: Let us consider a vertex v that lies on one of the paths P{xi, Cg). Its 
label L(v) ends with a non-empty block of 2’s that is preceded by z binary digits 
that uniquely identify the index s. These z binary digits are preceded by a digit 
1, which in turn is preceded by the digit ai. Since the edge Cg is incident to two 
vertices Xi and Xj with different colors in the 7-coloring, we have ai ^ a j. Hence, 
the index s together with the letter ai uniquely identifies the path on which the 
vertex v lies. The only possible label for a successor results by removing the first 
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digit, and by appending a 2. Hence, all the arcs on all the paths P{xi,es) are 
correctly encoded. □ 

Combining the statements of Lemma and [7| now yields that there exists a 7 - 
coloring for H if and only if there exists a ( 7 , fc)-labeling for G. Since 7 -coloring 
remains NP-hard for every fixed 7 > 3, we get the following theorem. 

Theorem 8 For any fixed a > 3 the following problem is NP-hard: Decide for 
an input graph G and an input parameter k whether G G . 

In fact our reduction also proves that it is hard to find labelings whose alphabet 
size is close to optimal, since all the inapproximability results from graph coloring 
(cf. eg. H^tad [B] and Khanna, Linial & Safra 0) immediately carry over to the 
labeling problem. E.g. since it is NP-hard to find a 4-coloring for a 3-colorable 
graph 0, it is NP-hard to find a (4, A:)-labeling for a graph G in Since (unless 
P=NP) there is no constant factor approximation algorithm for graph coloring 
IQ, there also is no constant factor approximation algorithm for minimizing the 
alphabet size a under a given label length k. 

Theorem 9 For any fixed a > 3 the following problem is NP-hard: Decide for 
an input graph G whether G S 

Proof. Let H = {X, E) be an arbitrary instance of GRAPH 7 -COLORING; 
throughout the proof we assume that 7 > 3 is a fixed constant. We repeat the 
polynomial time construction of above: We make n = |A| = \E\ = 2^ for some 
integer z, and we construct a directed graph G = (P, A), such that G is in £ 22 - 1-3 
if and only if the undirected graph Ed is 7 -colorable. Finally, we define the graph 
G' = iy' , A') to be the disjoint union of this graph G and of the De Bruijn graph 
5(7 — 1, 2z -I- 3). The De Bruijn graph 5(7 — 1, 2z -I- 3) has 

(^_lj 2 z +3 < ^ 21 ogn -|-3 ^ ^3^21og7 

vertices. Hence, the size of G' is polynomial in the size oi H = (X,E), and the 
whole construction can be done in polynomial time. 

We claim that G' is in ^ ™ ^ 2 z-\- 3 - First, assume that 

G' is in £2o- Then G' is a vertex induced subgraph of R( 7 , k) for some k. Since 
on the other hand B{'y — l,2z -\- 3) is a subgraph of G', Theorem [2] now yields 
that fc = 2z -I- 3 must hold. Therefore, the vertex induced subgraph G of G' 
indeed is in £22+3- Next, assume that G is in £22+3- Then we can reuse the 
( 7 , 2 z-|- 3)-labeling that we defined in Lemmadfor G. Moreover, the De Bruijn 
graph 5(7 — 1, 2z -I- 3) can be ( 7 , 2z -\- 3)-labeled in the natural way by all 
words of length 2z -|- 3 over the alphabet {0, 1, . . . , 7 — 1} \ {2}. In the labels for 
vertices in G, every substring of length z -|- 3 contains at least once the digit 2. 
In the labels for vertices in B{'j — l,2z -I- 3), the digit 2 does not show up at 
all. Gonsequently, these labelings do not generate any cross-edges between the 
graphs G and 5(7 — 1, 2z -I- 3), and they together yield a ( 7 , 2z -I- 3)-labeling for 
G'. 

If we combine these statements with the statements in Lemma Eland 0 we 
get that G' is in if and only if the graph H is 7 -colorable. □ 
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Corollary 10 It is NP-hard to decide whether an input graph G is a DNA 
graph. □ 
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Abstract. Let tt be any fixed polynomial-time testable, non-trivial, 
hereditary property of graphs. Suppose that the vertices of a graph G 
are not necessarily linearly ordered but partially ordered, where we think 
of this partial order as a collection of (possibly exponentially many) lin- 
ear orders in the natural way. We prove that the problem of deciding 
whether a lexicographically first maximal subgraph of G satisfying tt, 
with respect to one of these linear orders, contains a specified vertex is 
NP-complete. 



1 Introduction 

Miyano [B] proved that the problem of computing the lexicographically first max- 
imal subgraph of a given graph, where this subgraph should satisfy some fixed 
polynomial-time testable, non-trivial, hereditary property tt, is P-hard (even 
when the given graph is restricted to be either bipartite or planar and tt is non- 
trivial on the class of bipartite or planar graphs, respectively). Because of the 
stipulations on tt, the lexicographically first maximal subgraph satisfying the 
property tt can be computed by a generic greedy algorithm. Note that Miyano’s 
result is widely applicable; to any polynomial-time testable, non-trivial, heredi- 
tary property tt, such as whether a graph is planar, bipartite, acyclic, of bounded 
degree, an interval graph, chordal, and so on. Miyano states that his work was 
inspired by that of Asano and Hirata [1], Lewis and Yannakakis |5], Watanabe, 
Ae and Nakamura [7] and Yannakakis |8] on node- and edge-deletion problems 
in NP. Typical in this work is the result of Lewis and Yannakakis that the 
problem of finding the minimum number of nodes needing to be deleted from a 
graph so that the graph satisifes a fixed polynomial-time testable, non-trivial, 
hereditary property tt is NP-hard. 

Of course, a tacit assumption in Miyano’s work is that the vertices of any 
graph are linearly ordered. In this paper, inspired by Miyano’s results, we re- 
turn to the setting of NP in that we consider computing lexicographically first 
maximal subgraphs of given graphs, where these subgraphs should satisfy some 
given polynomial-time testable, non-trivial, hereditary property tt, except that 
now the graphs come equipped with not just one linear ordering of their ver- 
tices but several. Hence, for a given graph we will be involved with a collection 
of lexicographically first maximal subgraphs and not just one. Note that if we 
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gave our linear orderings explicitly then a graph on n vertices could only come 
with a polynomial (in n) number of such linear orderings (as otherwise it would 
be unreasonable to define that the whole input has size n) and we would still 
be working within P. In order to work with an exponential number of linear 
orderings, we present our collection of linear orderings in the form of a partial 
order, j. e., an acyclic digraph, with a source vertex providing the (common) least 
element of any of the linear orderings. As in Miyano’s deterministic scenario, a 
non-deterministic polynomial-time greedy algorithm computes all lexicographi- 
cally first maximal subgraphs. 

Our main result is that the problem of deciding whether a lexicographically 
first maximal subgraph of a given partially ordered graph, where this subgraph 
should satisfy some fixed polynomial-time testable, non-trivial, hereditary prop- 
erty 7T, contains some specified vertex is NP-complete (even when the given 
graph is restricted to be planar bipartite and tt is non-trivial on this class of 
graphs). We use similar techniques to Miyano although our proofs are compar- 
atively simpler and the combinatorics is very different. 

It is not at all obvious as to how we might use Miyano’s result for P- 
completeness directly to prove an analogous result for NP-completeness but 
with partial orderings replacing linear orderings (indeed, we have failed with 
this approach and have had to revert back to ‘first principles’). Furthermore, 
the NP-completeness results from | I1IRI7I8| can not be formulated in our frame- 
work. For example, the result of Lewis and Yannakakis |3, mentioned above, is 
concerned with sizes of maximal subgraphs satisfying a specific property and as 
such is unrelated to our problems. 

2 Basic Definitions 

For standard graph-theoretic definitions, the reader is referred to |2]. A property 
7T on graphs is hereditary if whenever we have a graph with the property tt, the 
deletion of any vertex and its incident edges does not produce a graph violating 
TT, i. e., TT is preserved by induced subgraphs. A property tt is called non-trivial 
on a class of graphs if there are infinitely many graphs from this class satisfying 
7T but TT is not satisfied by all graphs of the class. 

Let TT be some property of graphs. Let G be a graph, let H he & partial 
order of the vertices of G, and let s and t be vertices of G. We assume that the 
partial order H is given in the form of an acyclic digraph detailing the immediate 
predecessors, i.e., the parents, and the immediate successors, i.e., the children, 
of each vertex. We think of a partial order H as encoding a collection of linear 
orders of the form s = sq, s\, S 2 , ■ ■ ■ , Sk, where s^+i is a child of Sj and Sk has no 
children. Note that a partial order can encode an exponential number of linear 
orders. 

The algorithm GREEDY(7 t) is as follows: 

input 

S := 0 

current -vert ex := s 
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(*) if TT(SU{current-vertex} ,G) then 
S := SU{current-vertex} 
fi 

while current -vert ex has at least one child in H do 
current -vert ex := a child of current-vertex in H 
(**) if TT(S[j{current-vertex} ,G) then 
S := SU{current-vertex} 
fi 
od 

output (S) 

where tt{S U {current-vertex} , G) is a predicate evaluating to ‘true’ if, and only 
if, the subgraph of G induced by the vertices of S' U {current-vertex} satisfies tt. 
Essentially, GREEDY (tt) non-deterministically chooses a linear order from those 
encoded in the partial order and performs a greedy execution on G with respect 
to the property tt and the chosen linear order. We say that a vertex v is the 
current-vertex if we have ‘frozen’ an execution of the algorithm GREEDY(7 t) 
immediately prior to executing either line (*) or line (*+) and the value of the 
variable current-vertex at this point is v. Note that in general the algorithm 
GREEDY(7 t) is non-deterministic and produces a collection of sets of vertices as 
outputs. If the property tt is hereditary and can be checked in polynomial-time 
then the algorithm GREEDY (tt) non-deterministically computes, in polynomial- 
time, the lexicographically first maximal subgraphs of the graph G satisfying tt 
with respect to the linear orders encoded within the partial order H . 

Let C be a class of graphs and let tt be some property of graphs. The problem 
GREEDY (partial order, C, tt) has: as its instances tuples (G, H, s, t), where G is 
a graph from C, H is a, partial order of the vertices of G and s and t are vertices 
of G; and as its yes-instances those instances for which there exists an execution 
of the algorithm GREEDY (tt) on input (G, H, s) resulting in the vertex t being 
output. The problem GREEDY (linear order, C, tt) is defined similarly. Miyano’s 
result from can be stated as follows. 

Theorem 1. Let tt be a polynomial-time testable, non-trivial, hereditary prop- 
erty on the class of graphs C, where C is the class of all graphs, the class of planar 
graphs or the class of bipartite graphs. Then the problem GREEDY {linear order, 
C, tt) is P-eomplete. □ 



3 Our Results 

In order to prove our main result, we need to first establish a completeness result 
for the specific problem GREEDY(partial order, planar bipartite, independent 
set) (we only sketch the proof due to space limitations). 

Theorem 2. The problem GREEDY{partial order, planar bipartite, indepen- 
dent set) is NP -complete. 
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Proof. (Sketch) We reduce from the known NP-complete problem Directed 
Hamiltonian Path (DHP): whose instances are triples (G,s,t), where G is a 
digraph and s and t are vertices of G; and whose yes-instances are instances for 
which there is a Hamiltonian path in G from s to t (see m- 

Let (G = (V,E),s,t) be an instance of DHP of size n. W.l.o.g. we assume 
that \V\ > 2, that the vertex set of G is {l,2,...,n} and that s = 1 and 
t = n. Corresponding to this instance, we build an instance {G' , H' , s' ,t') of 
GREEDY (partial order, planar bipartite, independent set). The vertex set V of 
G' and H' is 



: z,j = 2, 3, ...,n- 1} U {x, s', t'}. 

The edges of G' are 

: t, j = 2,3, . . . ,n - 1} 

CKrCjj-, Ui_^i j) : t = 2, 3, . . . , n 2; J = 2, 3, . . . , n 1} 

Zj) j = 2,3, . . . ,n 1} 

U{{zj,Wn-i,j+i) : j = 2, 3, . . . , n - 2} 

U{{Zn-l,t')} 

and the edges of H' are 

: i = 2,3,.. .,n- 2; j, j' = 2, 3, . . . , n - 1; {j,f) G E} 
U{(s',U 2 ,j) : j = 2,3, . .. ,n- 1; (1, j) G E} 

U{{vn-ij,x) : j = 2,3, ...,n-l;(j,n) G E} 

U{(a;,U2.2)} 

, Wi^j ) . i — 2,3,.. .,?r 2, j — 2,3,. ..,?r 1} 

(rCj j' , ) . i — 2,3,.. .,?r 2, j — 2,3,.. .,?r 1} 

U{{un-i,j,U 2 ,j+i : j = 2,3, . . . ,n - 2} 

(Uyi— 1 , Wn—1.2)\ 

U{{wn-i,j,Zj) : j = 2,3, . . . ,n - 1} 

i,j+i) ■ J = 2, 3, . . . , n. 2} 

U{(z„_i,t')} 

The construction of the instance {G' , H' , s' ,t') is illustrated in Figs. ?? and 
E] which depict: a digraph G; the resulting graph G'; and the resulting partial 
order E[' , respectively. We omit the proof that (G, s,t) is a yes-instance of DHP 
if, and only if, (G', iJ', s', t') is a yes-instance of GREEDY (partial order, planar 
bipartite, independent set). □ 
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Now we consider the problem GREEDY(partial order, planar bipartite, tt) 
where tt is any polynomial-time testable, non-trivial, hereditary property. We 
begin with some specific graph-theoretic definitions. 

A cut-point of a connected graph G is a vertex c such that its removal (along 
with its incident edges) from G results in a graph with at least 2 connected 
components. A component relative to a cut-point c is a subgraph consisting of c, 
one of the derived connected components and all those edges of G joining c and 
a vertex of the component. If a connected graph does not have any cut-points 
then it is biconnected . 

Let a = (oi, 02, . . . , Ug) and b = (6i, 62, . . . , 6t) be two tuples of positive 
integers. We order these tuples lexicographically as follows. We say that a >L b 
if either: 

• there exists some i G {1, 2, . . . , min{s, t}} such that aj = bj, for all j G 
{1, 2, . . . , i — 1}, and Ui > bi] or 

• s > t and Gj = bj, for all j G { 1 , 2 , . . . ,t}. 
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Fig. 3. The partial order H' corresponding to G. 



The a-sequence ac of a connected graph G is defined as follows. Suppose 
that G is not biconnected. If c is a cut-point of G whose removal results in a 
graph with k connected components then define ac,c = where 

ni > U 2 > ■ ■ ■ > rik are the numbers of vertices in the components relative to 
c. We define ac to be the lexicographically-minimal tuple of the (non-empty) 
set {oc.G : c is a cut-point of G}, and we define cq to be any cut-point for 
which ac = ctca,G- If G is biconnected then removing any vertex will result 
in one connected component relative to the removed vertex, and so we define 
ac = (|G|) and cq as any vertex. 

Given a graph G with connected components Gi, G 2 , . . . , Gk, the f3-sequence 
Pg of G is defined as (acnOGa) • ■ • >«Gfe), where aci >l oGs >l->l aCt- A 
/9-sequence is therefore a tuple of tuples of integers. 

Theorem 3. Let tt be a property satisfying the following eonditions: 

• 7T is non-trivial on planar bipartite graphs; 

• TT is hereditary on indueed subgraphs; 

• TT is satisfied by all sets of independent edges; and 

• TT is polynomial-time testable. 

The problem GREEDY{partial order, planar bipartite, tt) is 1S!P -complete. 
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Proof. For brevity, we refer to the problem GREEDY (partial order, planar bipar- 
tite, 7 t) as Q. The property tt is, by assumption, non-trivial on planar bipartite 
graphs. It follows that amongst all planar bipartite graphs violating tt, there 
must be (at least) one with smallest /3-sequence, where /3-sequences are ordered 
lexicographically and where the comparison of components, i.e., a-sequences, is 
according to >l- Let us call such a graph J; that is, 

(3j = min{/3G : G is a planar bipartite graph violating tt}. 

Let Ji, J 2 , . . . , Jfc be the connected components of J ordered according to aj^ >l 
C(J 2 ctj^. It follows that J has /3-sequence /3j = (oj,^ , aj^, . . . , ojj.). 

Let c = cjj and let the connected components of Ji relative to c be /q U {c}, Ii U 
{c}, ... ,/m U {c}, where |/o| > |/i| > . ■ . > |/m|- Denote by /, the subgraph of 
Ji induced by the vertices of {c} U I\ U . . . U Im- As tt is non-trivial on planar 
bipartite graphs and it is hereditary on induced subgraphs, it follows that tt is 
satisfied by any independent set of vertices, and so Iq must contain at least one 
edge (otherwise J would be a set of independent vertices) . 

To prove the NP-completeness of the problem Q, we reduce from the problem 
GREEDY(partial order, planar bipartite, independent set), which, for brevity, 
we call PL (it was proven NP-complete in Theorem [5|). That is, from an instance 
(G, H, s, t) of PL, we create an instance (G', H' , s' , t') of Q. 

We will divide the construction of G' from G into three phases. For any 
subset of vertices U of J, we denote by {U) the subgraph of J induced by the 
vertices of U. Note that as (Jq U {c}) contains at least one edge and is connected, 
there exists a vertex d of /q U {c} such that (c, d) is an edge of (/q U {c}). 

Phase 1 For each vertex u of G, we attach a copy of (/, U {c}) by identifying 
u with c (all such copies are disjoint). Gall the resulting graph G. Note that 
the vertex set of G consists of the vertices of G, which we call the G-vertices, 
together with disjoint copies of the vertices of /». As both (/») and G are planar 
and bipartite, G maintains these properties. 

Phase 2 We replace each edge {u,v) of G, where u and v are G-vertices, by 
a copy of {Iq U {c}) by identifying u with c and v with d (all such copies are 
disjoint). Note that our choice of d results in the graph so formed being planar 
and bipartite. 

Phase 3 We add disjoint copies of J 2 , J 3 , ■ ■ ■ , Jk to obtain G', which is clearly 
planar and bipartite. 

The partial order H' consists of a linear order onto which is concatenated the 
partial order H (of the G-vertices) . The linear order consists of: all vertices of G' 
that are vertices of some copy of (/q); followed by all vertices in the copies of (/*); 
followed by all vertices of J 2 , J 3 , ■ ■ ■ , Jk- It does not matter how we order the 
vertices of some copy of (/q), for example, in the linear order. We concatenate 
this linear order prior to H by including an edge from the last vertex of the 
linear order to the vertex s of H. Denote the vertex s' to be the first vertex of 
the above linear order, and denote the vertex t' to be the G- vertex of G' formerly 
known as t. Our construction can be visualised in Fig. [d] 
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We will now state three lemmas to be used in the remainder of the proof. 



Lemma 1. Any graph K consisting of any number of disjoint copies of (/q \ 
{d}) plus any number of disjoint copies of (/*) plus a disjoint copy of each of 
J2, J3, . ■ . ,Jk satisfies tt. 
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Proof. The connected components of K consist of J2, J3, ■ ■ ■ , Jt together with 
the connected components of the copies of (/g \ {d}) and (/*). Consider the a- 
sequence a of a connected component of either (/g\ {d}) or (J»). All components 
of a are strictly less than |Jo| + 1; and so a is strictly less than aj^. Hence, (3k 
has one less component equal to than (3j, with all other components strictly 
less than and so K satisfies tt by minimality oi (3 j. □ 



Lemma 2. Take a single copy of (J, U {c}) and any number of disjoint copies 
of ((/g \ {d}) U {c}), and identify the vertices named c in all of these graphs. 
Then the resulting graph M satisfies tt. 

Proof. Let M' be the connected component of M containing c. We begin by 
remarking that any other connected component of M has an a-sequence strictly 
less than the a-sequence (|/o| -I- 1); and so strictly less than aj^. 

Suppose that c is a cut-point of M' . Then Oc,m' has components |/i|-|-l, I/ 2 I + 
1, . . . , |/m| -I- 1 as well as possibly some other components which are all strictly 
less than |Jo| -|- 1. Hence, by arguing similarly to as in the proof of Lemma [T] 
aM’ is strictly less than aj^. By the remark above, (3m is strictly less than (3j 
and so M satisfies tt by the minimality of (3j. 

Suppose that c is not a cut-point of M' . Then J, = Ji, i.e., m = 1, and 
M' = (/,); hence, om' is at most (|/i| -|- 1). Any connected component of M 
different from M' has size at most |/o|— 2, and so = (|/g|-|-l, |/i|-l-l) is strictly 
greater than the a-sequence of any connected component of M . Consequently, 
(3m is strictly less than (3j; and M satisfies tt by the minimality of (3 j. □ 



Lemma 3. Any graph N consisting of disjoint copies of .J 2 , J 3 , . . . ^ Jk plus any 
number of disjoint copies of the graph M from Lemma d satisfies tt. 

Proof. By the proof of Lemma |2] the graph M is such that the maximal com- 
ponent of (3m is strictly less than aj^. By reasoning as we did in the proof of 
Lemma [H it follows that (3n is strictly less than (3j and so N satisfies tt by the 
minimality of /3j. □ 

Throughout, we refer to a G-vertex in G' and the corresponding vertex in G 
by the same name (and also to a vertex of H and the corresponding vertex in 
the portion of the partial order H' corresponding to H by the same name). 

Consider the algorithm GREEDY(Tr) on input {G' , H' , s' ,t'). The partial 
order H' consists of a linear order, whose vertices are Sq, say, concatenated 
with the partial order H. The subgraph of G' induced by the vertices of Sq is 
as is the graph K of Lemma [1] and consequently every vertex of Sq is always 
placed in every output from GREEDY (tt). Note that the algorithm GREEDY (tt) 
on input {G' , H' , s' ,t') with current-vertex s is working with exactly the same 
partial order, namely H, as is the algorithm GREEDY (independent set) on input 
(G, H, s, t) with current- vertex s. 

Suppose, as our induction hypothesis, that: 
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• the algorithm GREEDY (independent set) on input (G,H,s,t) has current- 
vertex It, for some ancestor u of s in H, and has so far output the set of 
vertices S] 

• the algorithm GREEDY(7 t) on input {G' , H' , s' ,f) has current vertex u in 
H' and has so far output the set of vertices S'o U 5'; and 

• the subgraph of G' induced by the vertices of So U S' is in the form of a 
subgraph of the graph N in Lemma O 

Note that the induction hypothesis clearly holds, in the base case, when the 
vertex u is actually s. 

Suppose that the algorithm GREEDY(7 t) outputs the vertex u. If u is such 
that adding u to S'o U S' completes a copy of Iq then we would have a copy 
of J within the subgraph of G' induced by the vertices of S'o U S' U {«}. This 
would yield a contradiction because this subgraph satisfies tt (by definition), tt 
is hereditary on induced subgraphs, and J would then have to satisfy tt. Hence, 
the vertex u is not joined to any vertex of S in G and so u is output by the 
algorithm GREEDY (independent set). 

Gonversely, if the algorithm GREEDY (independent set) outputs u then this 
is because S U {m} is an independent set in G; and consequently So U S U {u} 
induces in G' a subgraph of the form of a subgraph of the graph N in Lemma |3] 
Hence, by Lemma u is output by the algorithm GREEDY(7 t). 

By induction, we obtain that if S is a set of vertices output by the algorithm 
GREEDY (independent set) on input {G,H,s,t) then So U S is output by the 
algorithm GREEDY(7 t) on input {G' , H' , s' ,t'), and conversely. Hence, we have 
a log-space reduction from TL to Q. □ 

Just as Miyano did in j^, we can actually remove the necessity in Theorem |3] 
for 7T to be satisfied by all sets of independent edges. Ramsey theory can be 
applied to show that any graph property that is non-trivial and hereditary on a 
class of graphs is either satisfied by all independent sets or by all cliques. We can 
then use this fact to eliminate the need for tt to be satisfied by all independent 
edges (we omit the details here but point out that we proceed exactly as Miyano 
did). Hence, we obtain the following. 

Corollary 1. Let tt be a polynomial-time testable, hereditary graph property that 
is non-trivial on planar bipartite graphs. The problem GREEDY{partial order, 
planar bipartite, tt) is complete for ISSP . □ 

We conclude by remarking that in the full version of this paper, we extend 
Gorollary [T]so that it applies to directed graphs and we examine degree bounds 
on graphs so as to delineate when a problem GREEDY (partial order, graphs, tt) 
becomes solvable in polynomial-time (for specific properties tt). 
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Abstract. We call an edge e of a perfect graph G critical if G — e 
is imperfect and say further that e is anticritical with respect to the 
complementary graph G. We ask in which perfect graphs critical and 
anticritical edges occur and how to find critical and anticritical edges 
in perfect graphs. Finally, we study whether we can order the edges of 
certain perfect graphs such that deleting all the edges yields a sequence 
of perfect graphs ending up with a stable set. 



1 Introduction 

The present paper is devoted to the investigation of critical and anticritical 
edges with respect to perfectness, a rich and well-studied graph property. Berge 
proposed to call a graph G = {V, E) perfect if, for each of its induced subgraphs 
G' C G, the chromatic number x(G') equals the clique number w(G'). I.e., for 
every induced subgraph G' of a perfect graph G we need as many stable sets to 
cover all nodes of G' as a maximum clique of G' has nodes (a set V' G V is a. 
clique (stable set) of G if the nodes in V are pairwise (non-)adjacent). 

Berge [T] conjectured two characterizations of perfect graphs. His first con- 
jecture was that a graph G is perfect iff the clique covering number x(G') equals 
the stability number a(G') VG' C G (i.e., that we need as many cliques to cover 
all nodes of G' as a maximum stable set of G' has nodes). Since complementa- 
tion transforms stable sets into cliques and colorings into clique coverings, we 
have a(G) = w(G) and x(G') = x(G') where G denotes the complement of G. 
Hence, Berge conjectured that a graph G is perfect if and only if its complement 
G is. This was proven by Lovasz m and is nowadays known as Perfect Graph 
Theorem. 

The second Berge conjecture concerns a characterization of perfect graphs via 
forbidden subgraphs. It is a simple observation that chordless odd cycles G 2 fc+i 
with k > 2, termed odd holes, and their complements G 2 fc+i, called odd antiholes, 
are imperfect. Clearly, each graph containing an odd hole or an odd antihole as 
induced subgraph is imperfect as well. Berge conjectured in [Tj that a graph is 
perfect if and only if it contains neither odd holes nor odd antiholes as induced 
subgraphs; such graphs are nowadays called Berge graphs. This still open Strong 
Perfect Graph Conjecture has already been verified for several classes of i^-free 
Berge graphs, e.g., if F is a claw (Parthasarathy and Ravindra [IS], see Fig. Ha)), 
a diamond (Tucker |T2], see Fig. Hb)), a clique of size 4 (Tucker [IB]), or a 
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bull (Chvatal and Sbihi |4], see Fig. [TJc)). Padberg [15] introduced the notion 
of minimally imperfect graphs: imperfect graphs with the property that remov- 
ing any of its nodes yields a perfect graph. Using this term, the Strong Perfect 
Graph Conjecture reads that odd holes and odd antiholes are the only minimally 
imperfect graphs. Therefore, minimally imperfect Berge graphs are termed mon- 
sters. In order to verify or falsify the Strong Perfect Graph Conjecture, many 
structural properties of minimally imperfect graphs have been discovered. To 
mention only a few of them, the following node pairs must not occur in mini- 
mally imperfect graphs G. It is well-known that G has no comparable pair (two 
nodes x and y with N{x) — yQ N(y), i.e., all neighbors of x except eventually y 
belong to the neighborhood of y). G does not admit twins {antitwins), i.e., two 
nodes x and y such that all remaining nodes of G are adjacent to both or to none 
of X and y (to either x or to y) due to the Replacement Lemma j1 1 J {Antitwin 
Lemma m)- Note that the property of being a comparable pair, twins, or an- 
titwins does not depend on whether or not x and y are adjacent. Furthermore, 
no minimally imperfect graph G contains an even pair (two nodes x and y such 
that all chordless paths connecting x and y have even length) by the Even Pair 
Lemma na and a star-cutset (a cutset S containing one node that is adjacent 
to all remaining nodes of S) by the Star-Cutset Lemma [3]- 




(a) (b) (c) 



Fig.El Some special graphs: (a) claw, (b) diamond, (c) bull. 



Although several, in general AfV-haxd, combinatorial optimization problems 
can be solved in polynomial time for perfect graphs j^, the structure of perfect 
graphs is not well understood. The recognition problem for perfect graphs is 
unsolved and, in particular, the Strong Perfect Graph Gonjecture still seems to be 
out of reach. On the other hand, the investigation of minimally imperfect graphs 
has revealed that these graphs have quite strong properties. This motivated us 
to consider new extremal graphs with respect to perfectness and lead to the 
concept of critical and anticritical edges in perfect graphs. The starting point of 
our investigation was the following graph evolution process. Imagine you have 
an arbitrary perfect graph and you consecutively delete one edge until you get 
a stable set, or you consecutively add one edge until a clique is reached. So 
you create a sequence of graphs starting and ending up with a perfect graph. 
But, if you choose the edges to be deleted or added randomly, most graphs of 
your sequence will be imperfect . The aim is to avoid the occurrence of imperfect 
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graphs in our sequence and, thereby, to give some information which perfect 
graphs may be constructed by consecutively deleting or adding edges. 

Problem 1. Given a certain perfect graph, is there a rule how to choose an edge 
to be deleted or added in order to keep perfectness? 

We call an edge e of a perfect graph G critical if G — e is imperfect. Moreover, 
we say that e is an anticritical edge of the complement G: e ^ E(G) holds 
and G + e is imperfect due to the Perfect Graph Theorem m- Whenever we 
delete (add) a critical (anticritical) edge e of a perfect graph G, we create in 
G — e (G + e) minimally imperfect subgraphs. In order to attack Problem[T], we 
study in Sect. |2] those subgraphs Gg Q G which yield the minimally imperfect 
subgraphs Ge — e Q G — e (Ge + e C G + e). One example of a critical edge 
is a single short chord of a cycle with odd length > 5 (that is a chord whose 
endnodes have distance two on the cycle), the deletion of e yields an odd hole. 
Hence, one rule in the sense of Problem [T| is “never omit a single short chord of 
an odd cycle” . Analogously, adding the edge between the endnodes of a chordless 
path P 2 k+i with k >2 yields an odd hole, too. “Never add an edge connecting 
the endnodes of a chordless path of even length > 4” is, therefore, another rule 
in the sense of Problem [I] Further results of that type are presented in Sect. 
Sect.[3]investigates whether it is easier to solve Problem[T]if the graph in question 
belongs to a certain subclass of perfect graphs. Gertain perfect graphs may not 
possess any critical or anticritical edge, then we could choose an arbitrary edge 
keeping perfectness. 

Problem 2. In which perfect graphs do critical and anticritical edges occur at 
all? 



One answer gives a new characterization of Mcynicl graphs. They have been 
introduced in [12] as graphs all odd cycles of length > 5 of which admit at 
least two chords. So critical edges in form of single short chords of odd cycles 
do obviously not occur in Meyniel graphs. Actually, removing any edge from a 
Meyniel graph keeps perfectness by [9] and we show: a graph is Meyniel if and 
only if it docs not admit any critical edge. In particular, every perfect but non- 
Meyniel graph admits at least one critical edge. We study in Sect. [3] for such 
classes of perfect graphs which minimally imperfect subgraphs may occur after 
deleting (adding) a critical (anticritical) edge. The large abundance of classes of 
perfect graphs led us to mention only results for some “classical” classes: Strongly 
perfect graphs have been introduced in [2] to be graphs all of whose subgraphs 
G' Q G admit a stable set that has a non-empty intersection with all maximal 
cliques of G' . Weakly triangulated graphs are defined to have neither holes Gk 
nor antiholes Gk with k > 5 as induced subgraphs [Bj. Meyniel m called a 
graph G strict quasi parity if each of its non-complete subgraphs has an even 
pair and quasi parity if G' or G owns an even pair for each subgraph G' C G. 
In Sect, m we treat the following problem and present several classes of perfect 
graphs for which the answer is in the affirmative. 
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Problem 3. For a certain perfect graph, is there an order of all the edges to be 
deleted (added) so that we get a sequence of perfect graphs ending up with a 
stable set (clique)? 

It turns out that it does not suffice to identify non-critical or non-anticritical 
edges: we can certainly remove an arbitrary edge from a Meyniel graph keeping 
perfectness but, at present, we do not know anything about critical edges of the 
resulting graph. Thus we have to look for edges the deletion or addition of which 
preserves membership within the corresponding subclass of perfect graphs. By 
this way, we present the studied ordering of edges to be deleted or added for, 
e.g., weakly triangulated graphs. 

2 Critical and Anticritical Edges 

For every critical (anticritical) edge e of a perfect graph G there is necessarily at 
least one subgraph Gg Q G such that Gg — e (Gg + e) is minimally imperfect. We 
study those subgraphs Gg in order to give an answer to Problem [TJ If Ge — e C 
G — e is an odd hole, then Gg is isomorphic to an odd cycle of length > 5 which 
admits precisely one chord, namely e. Moreover, e has to be a short chord of 
this cycle, forming a triangle with two of its edges, since Gg must not contain 
an odd hole (the case Ge — e = G 7 is depicted in Fig.[^a) with e = xy). Thus, 
we have immediately: 

Proposition 1. IfGg — e is an odd hole G 2 k+i with k>2, then uj{Gg) = 3 and 
a{Gg) = k holds. Gg contains an even hole G 2 k- ^ bull and the complement of 
a claw and of a diamond appear in Gg if k > 3. 

In the complementary graph G, e is an anticritical edge and Gg + e an odd 
antihole (see Fig. |2l)b)). Clearly, the complementary statement of Proposition [1] 
is true for Gg. 

Proposition 2. IfGg + e is an odd antihole G 2 k+i with k>2, then w(Gg) = k 
and a{Gg) = 3 holds. Gg contains an even antihole G 2 k- A bull, a claw, and a 
diamond appear in Gg if k > 3. 







Fig. HI The cases |Ge| = 7 with critical or anticritical edge e = xy. 
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If Ge — e C G — e is an odd antihole, then Ge is the complement of an 
induced path P 2 k+i with k >2 (the case Gg — e = G 7 is depicted in Fig. [^c) 
with e = xy) . We have, therefore: 

Proposition 3. If Ge — e is an odd antihole C 2 k+i with k >2, then co(Ge) = 
k + 1 and a(Gg) = 2 holds. Ge is isomorphic to P 2 k+i- ^ diamond appears in 
Ge if k > 3. 

In the complement G, e is an anticritical edge, Gg + e an odd hole, and Gg 
an induced path (see Fig. EJd)). The complementary assertion of Proposition |3] 
is true for Gg. 

Proposition 4. //Gg + e is an odd hole C 2 k+i with k>2, then uj{Ge) = 2 and 
a(Ge) = k+1 holds. Ge is isomorphic to P 2 k+\. The complement of a diamond 
appears in Ge if k > S. 

Of course, such a description cannot be given if Gg — e or Gg + e is supposed 
to be a monster and, if the Strong Perfect Graph Conjecture is true, this case 
does not occur at all. With look at Problem [T] we list some properties of Gg if 
e is a critical edge and Gg — e a generally minimal imperfect graph. 

Lemma 1. Let e be a critical edge of a perfect graph G and Ge — e G G — e be 
minimally imperfect. 

(i) The endnodes of e belong to the intersection of all maximum cliques of Ge. 

(ii) The endnodes of e occur in a triangle of Ge. 

(iii) An even hole is running through e in Ge. 

(iv) If Ge — e is a monster, then e belongs to a and to a diamond of Ge. 

Proof. Consider a critical edge e of a perfect graph G and let Gg — e C G — e be 
minimally imperfect. We know o;(Gg) = x(G'g) but w(Gg — e) < x(Gg — e). Since 
the removal of an edge cannot increase the chromatic number, w(Gg) > w(Gg — e) 
follows. That means deleting e destroys all maximum cliques of Gg and (i) is true. 
In particular, w(Gg) = 2 and o;(Gg — e) = 1 follows from (i) if the endnodes of 
e do not possess any common neighbor in Gg. Hence the endnodes of e occur in 
a triangle of Gg and (ii) is shown. 

Due to the Even Pair Lemma m there is an odd induced path between 
every two non-adjacent nodes of Gg — e, in particular, between the endnodes of 
e. Thus an even hole is running through e in Gg and we obtain (iii). 

In the special case if Gg — e is supposed to be a monster, we need w(Gg— e) > 4 
due to a result of Tucker [18]. Condition (i) implies w(Ge — e) < 4 if e does not 
belong to a in Ge. To complete the proof of (iv), consider a common neighbor 
z of the endnodes x and y of e which exists by (ii). The nodes x, z, y induce a P 3 
in Gg — e, hence a hole G is running through x, z, y in Gg — e due to a result of 
Hoang [To] . If Gg — e is a monster, G has to be an even hole of length 4 (otherwise 
Gg — e or Gg contain an odd hole). □ 

As immediate consequence, the complementary results hold for Gg if e is 
anticritical. 
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Lemma 2. Suppose e = xy to be a critieal edge of the perfeet graph G. 

(i) The nodes x and y belong to a stable set of size 3 in G. 

(ii) G admits an even antihole eontaining x and y. 

(iii) If e is M-anticritieal, then x and y appear in the eomplement of a diamond 
and are eontained in a stable set of size 5. 

Lemma[TJii),(iii) and Lemma[21Ji),(ii) can be seen as rules which we ask for in 
Problem[T] given a perfect graph, how to identify a non-critical or non-anticritical 
edge? We are able to give further conditions when an edge of a perfect graph G 
cannot be critical. 

Lemma 3. Suppose e = xy to be a critical edge of a perfect graph G. Then x 
and y do not form twins, antitwins, or a comparable pair of G. Neither x nor y 
is a simplicial node in G (that is a node with a clique as neighborhood). 

Proof. Consider a perfect graph G, a critical edge e = xy of G, and a subgraph 
Ge Q G such that Ge — e is minimally imperfect. Then x and y do not form a 
comparable pair, twins, and antitwins of G, else they would also be a comparable 
pair, twins, and antitwins in Ge — e leading to a contradiction to a well-known 
fact, the Replacement Lemma and the Antitwin Lemma |14| . respectively. 
Now, neither x nor y can be a simplicial node, since then x and y are either 
twins or a comparable pair of Ge — e. □ 

If two nodes form twins, antitwins, or a comparable pair in G, they do so in 
the complementary graph G. In addition, x and y form a 2-pair if all induced 
paths connecting x and y have length two. We can also show that x and y must 
not form a 2-pair if e ^ E{G) is supposed to be an anticritical edge of G. 

Lemma 4. Let e = xy be an anticritical edge of the perfect graph G. Then x 
and y are neither twins, antitwins, a comparable pair, nor a 2-pair of G. 

Proof. Consider a perfect graph G, an anticritical edge e = xy ^ E{G), and 
Ge Q G such that Ge-l-e C G— e is minimally imperfect. Then Lemma[^ implies 
that X and y does not form a comparable pair, twins, and antitwins of G. Now 
assume x and y to form a 2-pair. In the case Nc„ (x) — Nc„ (y) — 0 (all neighbors 
of X in Ge belong to the neighborhood of y), x and y would be a comparable 
pair in Ge -\- e, hence there is a node x' € Nc^{x) — Nc^fy). Since x and y are 
a 2-pair in Ge and x' and y are non-adjacent, all paths in Ge -I- e connecting 
x' and y must contain a; or a node in Ng^{x) fl Nc^iy) in contradiction to the 
Star-Cutset Lemma [2]. □ 

3 Which Graph Classes Admit Critical 
or Anticritical Edges? 

This section is devoted to the investigation of Problem |21 In which perfect graphs 
do critical or anticritical edges occur at all? More precisely, are there classes of 
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perfect graphs so that no graph in this class has a critical or anticritical edge? 
For some graph classes, this follows immediately using the results established 
in the previous section. E.g., every critical edge is contained in a triangle by 
Lemma l^ii), so bipartite graphs do not possess any critical edge (since they con- 
tain no odd cycles). Lemma[3liii) says that every critical edge has to occur in an 
even hole. Consequently, no triangulated graph has a critical edge (since trian- 
gulated graphs do not admit any hole of length > 4) . Both properties together 
yield the existence of an odd cycle of length > 5 in the union of the triangle and 
the even hole. So line-perfect graphs cannot admit critical edges, too (since they 
are defined to contain no odd cycles of length > 5) . 

The next theorem characterizes the perfect graphs without any critical edge. 
It also yields a new characterization of Meyniel graphs, defined to contain no 
odd cycles of length > 5 with at most one chord. The proof of this theorem 
relies on the perfectness of a superclass of Meyniel graphs. Let G = {V, E) be a 
Meyniel graph and V C V. The slim graph G{V) is obtained by deleting every 
edge of G with both endnodes in V and G{V') is perfect due to Hertz [H]. 

Theorem 1. (Hougardy, Wagler) A perfect graph does not admit any critical 
edge if and only if it is Meyniel. 

Proof. Let G be a perfect graph without any critical edge. Assume G to be not a 
Meyniel graph then G admits an odd cycle G with length > 5 which has at most 
one chord. Since G is perfect, G cannot be chordless, hence G possess precisely 
one chord e. If e were not a short chord of G, an odd hole would be contained 
in G. Thus e has to be a short chord of G and so it is an H-critical edge of G in 
contradiction to the precondition. 

On the other hand, suppose G = (V,E) to be a Meyniel graph. Then, for 
every subset V' C V, the graph G{V') generated from G by deleting every edge 
of G with both endnodes in V is a slim graph, still perfect by Hertz [2]. Thus, 
if we choose an arbitrary pair of adjacent nodes in a Meyniel graph and delete 
the edge connecting them, we get a perfect graph again, i.e., a Meyniel graph 
cannot admit any critical edge. □ 

Making use of this new characterization of Meyniel graphs, a perfect graph 
does not admit any anticritical edge if and only if it is co-Meyniel, the comple- 
ment of a Meyniel graph. Hence we obtain immediately: 

Corollary 1. A perfect graph does neither admit any critical nor anticritical 
edge if and only if it is both Meyniel and co-Meyniel. 

Consequently, critical (anticritical) edges occur in all perfect graphs G not 
contained in the class of Meyniel (co-Meyniel) graphs. Our aim is to find out 
which minimally imperfect subgraphs may occur in G — e (G -I- e). The results 
listed in the next lemma are obtained by combining the knowledge on forbid- 
den subgraphs in the respective graph classes and the results from the previous 
section. 
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Lemma 5. Let G be a perfeet graph, e G E(G) be a critieal edge of G, and 
e! ^ E{G) be an anticritical edge of G. 

(i) If G is diamond-free Berge, then G — e and G + e' do not eontain a G 2 k+i 
with k > 3 by Proposition 0 and 0, Furthermore, G — e eannot admit any 
monster by LemmaW(iv). 

(ii) For every K/^-free Berge graph G, Proposition and Lemma\J](iv) imply 
that G — e has odd holes as the only minimally imperfeet subgraphs. G + e' 
does not eontain any C 2 k+i with k > 4 by Proposition^ 

(iii) Is G weakly triangulated or bull-free Berge, then G — e and G -\- e' cannot 
eontain C 2 k+i o.nd G 2 /c+i with k > 3 by Proposition^^ and\^ respectively. 

(iv) Is G strict quasi parity, a slim graph, or strongly perfect, then no even 
antihole G 2 k with k > 3 appears in G. Hence, G + e does not admit a 
C* 2 fc+i with k > 3 by Proposition]^ 

4 Perfect and Co-perfect Edge Orders 

After considering the occurrence of critical edges in several classes of perfect 
graphs, we turn to Problem E] we are interested whether it is possible, for graphs 
in a certain class C of perfect graphs, to successively delete or add edges keeping 
perfectness until a stable set or a clique is reached. Moreover, the existence of 
such edge orders for all graphs in C would provide a constructive method to 
generate the graphs in C by consecutively deleting or adding edges. For that, we 
use knowledge from the previous sections. 

Let G = (V, E) be a perfect graph. We call a numbering (ei, . . . , Cm) of its 
edge set E a perfect edge order if, for G = Gq, all graphs Gi := Gi_i — are 

perfect for 1 < i < m. Clearly, has to be a non-critical edge of Gi_i for 
1 < i < m, and Gm is a stable set. Analogously, we say that a perfect graph 
G admits a co-perfect edge order iff its complement G has a perfect edge order. 
Here we simply use the numbering of the edges of G for the non-edges of G and 
get finally a clique. 

Note that it does not suffice to identify non-critical or non-anticritical edges 
in the perfect graphs in question. E.g., we can certainly delete an arbitrary edge 
of a Meyniel graph keeping perfectness by 0, but we may obtain a slim graph 
that is not Meyniel and do not know anything about its non-critical edges so far. 
Hence, we cannot provide perfect edge orders of Meyniel graphs, although the 
graphs in this class are even characterized that they do not contain any critical 
edge due to Theorem 0 So we mainly have to look for edges such that their 
deletion or addition preserves the membership to the corresponding subclass of 
perfect graphs. 

In general, we have to look for critical (anticritical) graphs with respect to 
the subclass C of perfect graphs under consideration: that are graphs which lose 
the studied property by deleting (adding) an arbitrary edge. If we can ensure 
that no (anti) critical graph with respect to C exists, then we know there is a 
(co-)perfect edge order for all graphs in C: every graph G in C admits at least 
one edge e such that G — e (G-\-e) still belongs to C. It is “only” left to find that 
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edge e. In order to answer the question whether or not Meyniel graphs admit a 
perfect edge order we have, therefore, to solve the following problem: 

Problem 4- Find a critical Meyniel graph or show that no such graph exists. 

Looking for graph classes without critical graphs, we first observe that there 
are no critical graphs with respect to every monotone class (that is a class 
defined by some forbidden partial subgraphs). The simplest example is the class 
of bipartite graphs having no odd cycles. Obviously, deleting an arbitrary edge of 
a bipartite graph yields a bipartite graph again. Hence, bipartite graphs admit 
perfect edge orders and, in particular, every of their edge orders is perfect. A 
superclass of bipartite graphs consists of all line-perfect graphs having no odd 
cycles of length > 5 as forbidden partial subgraphs. Again, there are no critical 
line-perfect graphs, hence every line-perfect graph admits a perfect edge order. 
Moreover, we have that they are precisely those perfect graphs such that every 
of their edge orders is perfect. 

Theorem 2. A graph is line-perfect iff all edge orders are perfect. 

Proof. A line-perfect graph G does not contain any odd cycle of length > 5 as 
partial subgraph. Obviously, G — e does also not contain any odd cycle of length 
> 5 Ve G E{G) and, therefore, is still line-perfect. Thus every ordering of E{G) 
is perfect. 

Now, assume G to be perfect but not line-perfect. Then G admits a cycle G 
of odd length at least 5 as partial subgraph. G cannot be chordless and every 
edge order of G that deletes all chords of C before an edge of G is not perfect. □ 

Bipartite and line-perfect graphs are subclasses of Meyniel graphs. A further 
class of Meyniel graphs for which we know a perfect edge order consists off all 
triangulated graphs. For those graphs, we have a well-known structural result, 
namely, that a graph is triangulated iff every subgraph has a simplicial node. 
Consider a triangulated graph G = {V, E) and let x be a simplicial node of G. 
Then no edge e incident to x is critical by Lemma The graph G — e is not 
only still perfect, but even still triangulated, since x is also simplicial in G — e. 
As a consequence, we obtain a perfect edge order for triangulated graphs: 

Theorem 3. Every triangulated graph G admits a perfect edge order (ei, . . . , Cm) 
with G = Go, Gi = Gi-i — ei, and ei incident to a simplicial node of Gi-\ for 
1 < i < m. 

Clearly, the complementary classes of bipartite, line-perfect, and triangulated 
graphs admit the corresponding co-perfect edge orders. For one class of perfect 
(but not Meyniel) graphs, both a perfect and a co-perfect edge order are known, 
namely, for weakly triangulated graphs. Let G be non-complete and weakly tri- 
angulated. Then a 2-pair x, y occurs in G due to a characterization of weakly 
triangulated graphs given by Hayward, Hoang, and Maffray in |S]. The graph 
G -|- xy is not only perfect by Lemma E] but still weakly triangulated by a result 
of Spinrad and Sritharan m- Consequently, we obtain a co-perfect edge order 
for weakly triangulated graphs by consecutively adding edges between 2-pairs 
(called 2-pair non-edge order in [17]). 
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Theorem 4. (Spinrad and Sritharan [HI) Every weakly triangulated graph G 
admits a co-perfect edge order (ei, . . . , e^) with G = Gq, Gi = Gi_i + such 
that the endnodes of Ci form a 2-pair in Gi-\ for 1 < i < ffi = \E(G)\. 

That the class of weakly triangulated graphs is closed under complementa- 
tion yields particularly a perfect edge order for every weakly triangulated graph. 
Hayward proved in jZ] that the following perfect edge order of a weakly triangu- 
lated graph G corresponds to the 2-pair non-edge order for G given by Spinrad 
and Sritharan [HI- 

Theorem 5. (Hayward |7]) Every weakly triangulated graph G admits a perfect 
edge order (ei,...,em) with G = Gq, Gi = Gi-i — Ci such that Ci is not the 
middle edge of any P 4 in Gi for 1 < i < m = \E{G)\. 

Weakly triangulated graphs are defined as a common generalization of tri- 
angulated and co-triangulated graphs. Hence, the above theorems provide also 
a co-perfect edge order for triangulated graphs and a perfect edge order for 
co-triangulated graphs. In addition, the 2-pair non-edge order for weakly trian- 
gulated graphs enables us to establish a co-perfect edge order for bipartite graphs 
G: either there are non-adjacent nodes a, b in different color classes of G and 
G-|- is still bipartite, or G is as complete bipartite graph weakly triangulated. 

At present, perfect or co-perfect edge orders are not known to the author 
for other subclasses of perfect graphs. However, we can ensure that no such 
edge order exists for some classes C of perfect graphs: If C contains a critically 
(anticritically) perfect graph G such that deleting (adding) an arbitrary edge 
yields an imperfect graph (see [20] for more details), then G is in particular 
critical (anticritical) with respect to C and there is no perfect (co-perfect) edge 
order for the graphs in C. 





(a) (b) 

Fig. [Sj (a) critically and anticritically perfect graph, (b) anticritically perfect 

graph. 



The graph depicted in Fig. [31(a) is critically and anticritically perfect and 
belongs to the classes of F-free Berge graphs where F is a claw, a diamond, or 
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a Ki. Hence, that graph is also (anti)critical with respect to those classes and 
there are, therefore, neither perfect nor co-perfect edge orders for those classes. 
The graph in Fig. Hb) is anticritically perfect, strongly perfect, strict quasi 
parity and quasi parity. Consequently, there are no co-perfect edge orders for all 
strongly perfect, all strict quasi parity, or all quasi parity graphs. In particular, 
there is no way to construct all the graphs belonging to those classes of perfect 
graphs by consecutively adding edges. 
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